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Multi-Antigen Vectors for Melanoma 

FIELD OF THE INVENTION 

The present invention relates to multi-antigen vectors for use in preventing and / or 
treating cancer. In particular, the invention relates to multi-antigen vectors for use in treating 
and/or preventing melanoma. 

BACKGROUND OF THE INVENTION 

There has been tremendous increase in last few years in the development of cancer 
vaccines with tumour-associated antigens (TAAs) due to the great advances in identification of 
molecules based on the expression profiling on primary tumours and normal cells with the help 
of several techniques such as high density raicroarray, SEREX, immunohistochemistry (IHC), 
RT-PCR, in-situ hybridization (ISH) and laser capture microscopy (Rosenberg, Immunity, 1999; 
Sgroi et al, 1999, Schena el 4 1995, Offringa et al, 2000)! The TAAs are antigens expressed or 
over-expressed by tumour cells and could be specific to one or several tumours for example CEA 
antigen is expressed in colorectal, breast and lung cancers. Sgroi et al (1999) identified several 
genes differentially expressed in invasive and metastatic carcinoma cells with combined use of 
laser capture microdissection and cDNA microarrays. Several delivery systems like DNA or 
viruses could be used for therapeutic vaccination against human cancers (Bonnet et al, 2000) and 
20 can elicit immune responses and also break immune tolerance against TAAs. Tumour cells can 
be rendered more immunogenic by inserting transgenes encoding T cell co-stimulatory 
molecules such as B7.1 or cytokines such as JFN-y, IL2, or GM-CSF, among others. Co- 
expression of a TAA and a cytokine or a co-stimulatory molecule can develop effective 
therapeutic vaccine (Hodge et al, 95, Bronte et al, 1995, Chamberlain et al, 1996). 

There is a need in the art for reagents and methodologies useful in stimulating an immune 
response to prevent or treat cancers. The present invention provides such reagents and 
methodologies that overcome many of the difficulties encountered by others in attempting to 
treat cancer. . 
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SUMMARY O F TTTE INVENTION 
The present invention provides multi-antigen vectors for administration to a patient to 
prevent and / or treat cancer. In particular, the multi-antigen vector encodes one or more tumor 
antigens ( l TA"). The multi-antigen vector may also encode an immune stimulator such as a co- 
5 stimulatory molecule and/or be administered with an adjuvant. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. Schematic of plasraids pALVAC.Triconi(#33) and pT1132. 
Figure 2. DNA scqneBce of plasmid pAI-.VAC.Tricora(#33). 
10 Figure 3. DNA sequence of plasmid pTl 132. 
Figure 4. Schematic of plasmid pT3217. 
FigureS. DNA sequence of plasmid pT3217. 

Figure 6. Amino acid sequences of exemplary NY-ESO-1, TRP-2, gplOO, gplOOM, MART- 
.1, MAGE-1, MAGE-3, B7.1, LFA-3, and ICAM-1 proteins. 

15 

DETAILED DESCRIPTION 

The present invention provides reagents and methodologies useful for treating and / or. 

preventing cancer. All references cited within this application are incorporated by reference. 

In one embodiment, the present invention relates to the induction or enhancement of an 

20 immune response against one or more tumor antigens ("TA") to prevent and / or treat cancer. In 

certain embodiments, one or more TAs may be combined. In preferred embodiments, the 

immune response results from expression of a TA in a host cell following administration of a 

nucleic acid vector encoding the tumor antigen or the tumor antigen itself in the form of a 

peptide or polypeptide, for example. 

25 As used herein, an "antigen" is a molecule (such as a polypeptide) or a portion thereof 

that produces an immune response in a host to whom the antigen has been administered. The 

immune response may include the production of antibodies that bind to at least one epitope of the 

antigen and / or the generation of a cellular immune response against cells expressing an epitope 

of the antigen. The response may be an enhancement of a current immune response by, for 

" 30 example, causing increased antibody production, production of antibodies with increased affinity 

for the antigen, or an increase in the cellular immune response (i.e., increased number or activity 

2 
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of imimuioreactive T cells). An antigen that produces an immune response may alternatively be 
referred to .as being immunogenic or as an immunogen. In describing the present invention, a 
TA may be referred to as an "immunogenic target". The present invention provide expression 
vectors for expressing in a host one or more immunogenic targets. 
5 The term TA includes both tumor-associated antigens (TAAs) * and tumor-specific 

antigens (TSAs), where a cancerous cell is the source of the antigen, A TAA is. an antigen.that is 
expressed on the surface of a tumor cell in higher amounts than is observed on normal cells or an 
antigen that is expressed on normal cells during fetal development, A TSA is an antigen that is 
unique to tumor cells and is not expressed on normal cells. TA further includes TAAs or TSAs, 

10 antigenic fragments thereof, and modified versions that retain their antigenicity. 

TAs are typically classified into five categories according to their expression pattern, 
function, or genetic origin; cancer-testis (CT) antigens (i.e., MAGE, NY-ESO-1); melanocyte 
differentiation antigens (i.e., Melan A/MART- 1, tyrosinase, gplOO); mutational antigens (i.e., 
MUM-1, p53, CDK-4); overexpressed.'self antigens (i.e., HER-2/neu, p53); and, viral antigens 

15 (i.e.,. HPV, EBV). For the purposes of practicing the present invention, a suitable TA is any TA 
that induces or enhances an anti-tumor immune response in a host to whom the TA has been 
administered. Suitable TAs include, for example, species of gplOO (Cox et al., Science, 264:716- 
719 (1994); U.S. Pat. No. 6,500,919 Bl and WO 01/30847 with Val at residue 162, also referred 
to as "gplOOM"; U.S. Pat. No. 6,537,560 -Bl with Phe at residue 162), MART- 1 /Melan A 
.20 (JCawakami et al., J. Exp. Med., 180:347-352 (1994); U.S. Pat. No. 5,874,560), gp75 (TRP-l) 
(Wang et al., /. Exp. Med., 186:1131-1140 (1996)), TRP-2 (Wang et al. 1996 J. Exp. Med. 
384:2207; U.S. Pat. Nos. 5,831,016 and 6,083,783), tyrosinase (Wolfel et all, Eur. J. Immunol, 
24:759-764 (1994); WO 200175117; WO 200175016; WO 200175007), NY-ESO-1 (WO 
98/14464; WO 99/18206; GenBank Accession No. P78358; U.S. Pat. No: 5,804,381), melanoma 

25 proteoglycan (Hcllstrom et al., /. Immunol, 130:1467-1472 (1983)), MAGE family antigens 
(i.e., MAGE-1, 2,3,4,6,12, 51; Van der Bruggen et al., Science, 254:1643-1647 (1991); U.S. Pat. 
Nos. 6,235,525; CN 131961 1), BAGE family antigens (Boel et al., Immunity, 2:167-175 (1995)), 
GAGE family antigens (i.e., GAGE-1,2; Van den Eynde et al., 7, Exp. Med., 182:689-698 
(1995); U.S. Pat. No. 6,013,765), RAGE family antigens (i.e., RAGE-1; Gaugler et at, 

30 Immunogenetics, 44:323-330 (1996); U.S. Pat. No. 5,939,526), N-acetylglucosaminyltransferase- 

V (Guilloux et at., J. Exp. Med., 183:1173-1183 (1996)), pl5.(Robbins et al., J. Immunol 

3 
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154:5944-5950 (1995)), B-catenin (Robbins et al, J. Exp. Med., 1 83:1185-1 192 (1996)); MUM-1 
(Coulie et.ai:, Proc. Natl. Acad. Sci. USA, 92:7976-7980 (1995)), cyclin dependent kinase-4 
(CDK4) (Wolfel et al., Science, 269:1281-1284 (1995)), p21-r« (Fossum et at., Int. J. Cancer, 
56:40-45 (1994)), BCR-aW (Bocchia et al., Blood, 85:2680-2684 (1995)), P 53 (Theobald et al., 
Proc. mil Acad. Sci. USA, 92:11993-11997 (1995)), P 185 HER2/neu (erb-Bl; Fisk et al., J. 
Exp. Med., 181:2109-2117 (1995)), epidermal growth factor receptor (EGFR) (Harris et al., 
Breast. Cancer Res.' Treat, 29:1-2 (1994)), carcinoembryonic antigens (CEA) (Kwong et al., / 
Natl. Cancer Inst., 85:982-990 (1995) U.S. Pat. Nos. 5,756,103; 5,274,087; 5,571,710; 
6,071,716; 5,698,530; 6,045,802; EP 263933; EP 346710; and, EP 784483); carcinoma- 
associated mutated mucins (i.e., MUC-1 gene products; Jerome et al., J. Immunol., 151:1654- 
1662 (1993)); EBNA gene products of EBV (i.e., EBNA-1; Rickinson et al., Cancer Surveys, 
13:53-80 (1992)); E7, E6 proteins of human papillomavirus (Ressing et al., J. Immunol, 
154:5934-5943 (1995)); prostate specific antigen (PSA; Xue et al., Tlie Prostate,- 30:73-78 
(1997)); prostate specific membrane antigen (PSMA; Israeli, et al., Cancer Res., 54:1807-1811 
15 (1994)); idiotypic epitopes or antigens, for example, immunoglobulin idiotypes or T cell receptor 
idiotypes (Chen et al.,./. Immunol., 153:4775-4787 (1994)); KSA (U.S. Patent No. 5,348,887), 
kinesin 2 (Dietz, et al. Biochem Bibphys Res Commun 2000 Sep 7;275(3):731-8), HIP-55, 
TGFP-1 anti-apoptotic factor (Toomey, et al. Br J Biomed Sci 2001 ;58(3): 177-83), tumor protein 
D52 (Bryne J.A., et al., Genomics, 35:523-532 (1996)), H1FT, NY-BR-1 (WO.01/47959), NY- 
20 BR-62, NY-BR-75, NY-BR-85, NY-BR-87, NY-BR-96 (ScanJan, M. Serologic and 
Bioinformatic Approaches to the Identification of Human Tumor Antigens, in Cancer Vaccines 
2000, Cancer Research Institute. New York, NY), including "wild-type" (i.e:, normally encoded 
by the genome, naturally-occurring), modified, and mutated versions as well as other fragments 
and derivatives thereof. Any of these TAs may be utilized alone or in combination with one 
25 another in a co-immunization protocol. 

Preferred TAs are useful for inducing an immune response against melanoma cells. The 
term "melanoma" includes but is not limited to melanomas, metastatic melanomas, melanomas 
derived from either melanocytes or melanocyte related nevus cells, melanocarcinomas, 
myoepitheliomas, melanosarcomas, melanoma in situ, superficial spreading melanoma, 
30 nodular melanoma, lentigo maligna melanoma, acral lentiginous melanoma, invasive melanoma 
and familial atypical mole and melanoma (FAM-M) syndrome, for example. In general, 
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melanomas result from chromosomal abnormalities, degenerative growth and development 
disorders, mitogenic agents, ultraviolet radiation (UV), viial infections, inappropriate tissue 
expression of a gene, alterations in expression of a gene or carcinogenic agents, for example. 

In certain coses, it may be beneficial to co-immunize patients with both TA and other 

5 antigens, such as angiogenesis-associated antigens ("AA")- An AA is an immunogenic molecule 
(i.e., peptide, polypeptide) associated with cells involved in the induction and / or continued 
development of blood vessels. For example, an AA may be expressed on an endothelial cell 
("EC), which is a primary structural component of blood vessels. Where the cancer isxancer, it 
is preferred that that the AA be found within or near blood vessels that supply a tumor. 

10 Immunization of a patient against an AA preferably results in an anti-AA immune response 
whereby angiogenic processes that occur near or within tumors are prevented and / or inhibited. 
Exemplary AAs include, for example, vascular endothelial growth factor (i.e., VEGF; 
Bernardini, et al. J. Urol., 2001, 166(4): 1275-9; Starnes, et al. J. Thorac. Cardiovasc. Surg., 
2001, 122(3): 518-23; Dias,"ct al. Blood, 2002, 99: 2179-2184), the VEGF receptor (i.e., VEGF- 

15 R, flk-l/KDR; Starnes, et al. ./. Tliorac. Cardiovasc. Surg., 2001, 122(3): 518-23), EPH receptors 
(i.e., EPHA2; Gerety, et al. 1999, Cell, 4: 403-414), epidermal growth factor receptor (i.e., 
EGFR; CiardciJIo, el al. Clin. Cancer Res., 2001, 7(10): 2958-70), basic fibroblast growth factor 
(i.e., bFGF; Davidson, ct al. Clin. Exp. Metastasis 2000,18(6): 501-7; Poon, et al. Am J. Surg., 
2001, 182(3):298-304), platelet-derived cell growth factor (i.e., PDGF-B), platelet-derived 

20 endothelial cell growth factor (PD-ECGf; Hong, et al. J. Mol. Med., 2001, 8(2):141-8), 
transforming growth factors (i.e., TGF-a; Hong, et al. J. Mol. Med., 2001, 8(2): 141-8), endoglin 
(Balza, et al. Int. J. Cancer, 2001, 94: 579-585), Id proteins (Benezra, R. Trends Cardiovasc. 
Med., 2001, 11(6):237-41), proteases such as uPA, uPAR, and matrix metalloproteinases (MMP- 
2, MMP-9; Djonov, et al. J. Pathol., 2001, 195(2): 147-55), nitric oxide synthase (Am. J. 

25 Ophthalmol., 2001, 132(4):551-6), aminopeptidase (Rouslhati, E. Nature Cancer, 2: 84-90, 
2002), thrombospondins (i.e., TSP-1, TSP-2; Alvarez, et al. Gynecol. Oncol., 2001, 82(2):273-8; 
Selci, et al. Int. J. Oncol., 2001, 19(2):305-10), k-ras (Zhang, et al. Cancer Res., 2001, 
. 61(16):6050-4), Wnt (Zhang, et al. Cancer Res., 2001, 61(16):6050-4), cyclic-dependent kinases 
(CDKs; Drug Resist. Updat. 2000, 3(2):83-S8), microtubules (Timar, et al. 2001. Path. Oncol 

30 Res., 7(2): 85-94), heat shock proteins (i.e., HSP90 (Timar, supra)), heparin-binding factors (i.e., 

heparinase; Gohji, et al. Int. J. Cancer, 2001, 95(5):295-301), synthases (i.e., ATP synthase, 

5 
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thymidilate synthase), collagen receptors, integrins (i.e., ao|33, ao(J5, aipi, a2pi,a5pl), the 
surface proteoglycan NG2, AAC2-1, or AAC2-2, among others, including "wild-type" (i.e., 
normally encoded by the genome, naturally-occurring), modified, mutated versions as well as 
other fragments' and derivatives thereof. Any of these targets may be suitable in practicing the 
5 present invention, either alone or in combination with one another or with other agents. 

The nucleic acid molecule may comprise or consist of a nucleotide sequence encoding 
one or more immunogenic targets, or fragments or derivatives thereof, such as that contained in a . 
DNA insert in an ATCC Deposit. The term "nucleic acid sequence" or "nucleic acid molecule" 
refers to a DNA or RNA sequence. The term encompasses molecules formed from any of the 
10 known base analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy- 
N6-methyladenosine, aziridinyl-cytosine, pseudoisocytosine, 5-(carbbxyhydroxylmethyl) uracil, 
5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouraci!, S-carboxy- 
• methylaminomethyluracil, dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1- 
methylpseudouracil, 1 -melhylguanine, l-methy)inosine, 2,2-dimethyl-guanine, 2-methyladenine, 
15 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, S- 
methylaminomethyluracil, 5-methoxyamino-methyl-2-thiouracil, beta-D-mannosylqueosine, 5' - 
methoxycarbonyl-methyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5- 
,20 oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 
2,6-diaminopurine, among others. 

An isolated nucleic acid molecule is one that: (1) is separated from at least about 50 
percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found 
when total nucleic acid is isolated from the source cells; (2) is not be linked to all or a portion of 
25 a polynucleotide to which the nucleic acid molecule is linked in nature; (3) is operably linked to 
• a polynucleotide which it is not linked to in nature; and / or, (4) does not occur in nature as part 
of a larger polynucleotide sequence. Preferably, the isolated nucleic acid molecule of the present 
invention is substantially free from' any other contaminating nucleic acid molecule(s) or other- 
contaminants that are found in its natural environment that would interfere With its use in 
30 polypeptide production or its therapeutic, diagnostic, prophylactic or research use. As used 
herein, the term "naturally occurring" or "native" or "naturally found" when used in connection 
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with biological materials such as nucleic acid molecules, polypeptides, host cells, and the like, 
refers to materials which are found in nature and are not manipulated by man. Similarly, "non- 
naturolly occurring" or "non-native" as used herein refers to a material that is not found in nature 
or that has been structurally modified or synthesized by man. 
5 The identity of two or more nucleic acid or amino acid sequences is determined by 

comparing the sequences. As known in the art, "identity" means the degree of sequence 
relatedness between nucleic acid or amino acid sequences as determined by the match between 
the units making up the molecules (i.e., nucleotides or amino acid residues). Identity measures 
the percent of identical matches between the smaller of two or .more sequences with gap 

10 alignments (if any) addressed by a particular mathematical model or computer program (i.e., an 
algorithm). Identity between nucleic acid sequences may also be determined by the ability of the 
'" nucleic acid sequences to hybridize to one another. In defining the process of hybridization, the 
term "Highly stringent conditions" and "moderately stringent conditions" refer to conditions that 
permit- hybridization of nucleic acid strands whose sequences are complementary, and to exclude 

15 hybridization of significantly mismatched nucleic acids. Examples of "highly stringent 
conditions" for hybridization and washing are 0.015. M sodium chloride, 0.0015 M sodium 
citrate at 65-68°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 50% formamide at 
42°C. (see, for example, Sambrook, Fritsch '& Maniatis, Molecular Cloning: A Laboraloiy 
' Manual (2nd ed., Cold Spring Harbor Laboratory, 1989); Anderson el a/., Nucleic Acid 

20 Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited))'. The term "moderately 
stringent conditions" refers to conditions under which a DNA duplex with a greater degree of 
base pair mismatching than could occur under lt highly stringent conditions" is able to form. 
Exemplary moderately stringent conditions are 0.015 M sodium chloride, 0.0015 M sodium^ 
citrate at 50-65°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 20% formamide at 

25 37-50°C. By way of example, moderately stringent conditions of 50°C in 0.015 M sodium ion 
will allow about a 21% mismatch. During hybridization, other agents may be included in the 
hybridization and washing buffers for the puipose of reducing non-specific and/or background 
. hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinylpyrrolidone, 0.1% 
sodium pyrophosphate, 0.1% sodium dodecylsulfate, NaDodS0 4 , (SDS), ficoll, Denhardt's 

30 solution, sonicated salmon sperm DNA (or another non-complementary DNA), and dextran' 

sulfate, although other suitable agents can also- be used. The concentration and types of these 

7 
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additives can be changed without substantially affecting the stringency of the hybridization 
conditions.. Hybridization experiments are usually carried out at pH 6.8-7.4; however, at typical 
ionic strength conditions, the rate of hybridization is nearly independent of pH. 

In preferred embodiments of the present invention, vectors are used to transfer a nucleic 

5 acid sequence encoding an immunogenic target to a cell. A vector is any molecuie used to 
transfer a nucleic acid sequence to a host cell. In certain cases, an expression vector is utilized. 
An expression vector is a nucleic acid molecule that is suitable for transformation of a host cell . 
and contains nucleic acid sequences that direct and / or control the expression of the transferred 
nucleic acid sequences. Expression includes, but is not limited to, processes such as 

10 transcription, translation, and splicing, if introns are present. Expression vectors typically 
comprise one or more flanking sequences operably linked to a heterologous nucleic acid 
sequence encoding a polypeptide. Flanking sequences may be homologous (i.e., from the same 
• species and / or strain as the host cell), heterologous (i.e., from a species other than the host cell 
species or strain), hybrid (i.e., a combination of flanking sequences from more than one source), 

15 or synthetic, for example. 

A flanking sequence is preferably capable of effecting the replication, transcription and / 
or translation of the coding sequence and is operably linked to a coding sequence. As used 
herein, the term operably linked refers to a linkage of polynucleotide elements in a functional 
relationship. For instance, a promoter or enhancer is"- operably linked to a coding sequence if it 
.20 affects the transcription of the coding sequence. However, a flanking sequence need not 
necessarily be contiguous with the coding sequence, so long as it functions correctly. Thus, for 
example, intervening untranslated yet transcribed sequences can be present between a promoter 
sequence "and the coding sequence and the promoter sequence may still be.considered operably 
linked to the coding sequence. Similarly, an enhancer sequence may be located upstream or 

25 downstream from the coding sequence and affect transcription of the sequence. 

In certain embodiments, it is preferred that the flanking sequence is a - transcriptional 
regulatory region that drives high-level gene expression in the target cell. The transcriptional 
regulatory region may comprise, for example, a promoter, enhancer, silencer, repressor element, 
or combinations thereof. The transcriptional regulatory region may be either constitutive, tissue- 

30 specific, cell-type specific (i.e., the region is drives higher levels of transcription in a one type of 

tissue or cell as compared to another), or regulatable (i.e., responsive to interaction with a 
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compound such as tetracycline). The source of a transcriptional regulatory region may be any 

prokaryotic of eukaryotic organism, any vertebrate or invertebrate organism, or any plant, 

provided that the flanking sequence functions in a cell by causing transcription of a nucleic acid 

within that cell. A wide variety of transcriptional regulatory regions may be utilized in practicing 

S the present invention. 

Suitable transcriptional regulatory regions include the CMV promoter (i.e., the CMV- 

immediate early promoter); promoters from eukaryotic genes (i.e., the estrogen-inducible 

chicken ovalbumin gene, the interferon genes, the gluco-corticoid-inducible tyrosine 

aminotransferase gene, and the thymidine kinase gene); and the major early and late adenovirus 

10 gene promoters; the SV40 early promoter region (Bernoist and Chambon, 1981, Nature 290:304- 

10); the promoter contained in the 3* long terminal repeat (LTR) of Rous sarcoma virus (RSV) 

' (Y amamoto, et al, 1980, Cell 22:787-97); the herpes simplex virus thymidine kinase (HSV-TK) 

promoter (Wagner et al, 19S1, Proc. Natl Acad, ScL U.S.A. 78:1444-45); the regulatory 

• sequences of the metallothionine gene (Brinster ef al, 1982, Nature 296:39-42); prokaryotic 

1*5 • expression vectors such as the beta-lactamase promoter (Villa-Kamaroff et al, 1978, Proc, Natl 

Acad. ScL USA, 75:3727-31); or the tac promoter (DeBoer et al, 1983', Proc. Nail. Acad. ScL 

U.S.A., 80:21-25). Tissue- and / or cell-type specific transcriptional control regions include, for 

example, the elastase I gene control region which is active in pancreatic acinar cells (Swift el al, 

1984, Cell 38:639-46; Ornitz et al, 1986, Cold Spring Harbor Symp. QuanL Biol. 50:399-409 

20 (1986); MacDonald, 1987, Hepatology 7:425-515); the insulin gene control region which is 

active in pancreatic beta cells (Hanahan, 1985, Nature 3} 5:11 S-22); the immunoglobulin gene. 

control region which is active in lymphoid cells (Grosschedl et al y 1984, Cell 38:647-5S; 

Adamcsera/., 1 985, Nature 3 18:533-38; Alexander etal,, 19S7,M>/. Cell. Biol, 7:1436-44); the 

mouse mammary tumor virus control region in testicular, breast, lymphoid and mast cells (Leder 

25 eial, 1986, Cell 45:485-95); the albumin gene control region in liver (Pinkert <?/ al, 1987, Genes 

and Devel 1:268-76); the alpha-feto-protein gene control region in liver (Krumlauf et al, 1985, 

Mol Cell Biol, 5:1639-48; Hammer et al, 1987, Science 235:53-58); the alpha 1-antitrypsin 

gene control region in liver (Kelsey et al, 1987, Genes and Devel. 1:161-71); the beta-globin 

gene control region in myeloid cells (Mogram et al, 1985, Nature 315:338-40; Kollias et al, 

30 1986, Cell 46:89-94); the myelin basic protein gene control region in oligodendrocyte cells in the 

brain (Readhead et al, 1987, Cell 48:703-12); the myosin light chain-2 gene control region in 
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skeletal muscle (Sani, 1985, Nature 314:283-86); the gonadotropic releasing hormone gene 
control region in the hypothalamus (Mason. et al. h 1986, Science 234:1372-78), and the 
. tyrosinase promoter in melanoma cells (Hart, I. Semin Oncol 1996 Feb;23(l): 154-8; Siders, et al. 
Cancer Gene Ther 1998 Sep-Oct;5(5):281-91), among others. Inducible promoters that are 
5 activated in the presence of a certain compound or condition such as light, heat, radiation, 
tetracycline, or heat shock proteins, for example, may. also be utilized (see, for example, WO 
00/1 06 1 2). Other suitable promoters are known in the art. 

' As described above, enhancers may also be suitable flanking sequences. Enhancers are 
cis-acting elbments of DNA, usually about 10-300 bp in length, that act on the promoter to 
J0 increase transcription. Enhancers are typically orientation- and position-independent, having 
been identified both 5' and 3' to controlled coding sequences. Several enhancer sequences 
• available from mammalian genes are known (i.e., globin,-elastase, albumin, alpha-feto-protein 
and insulin). Similarly, the SV40 enhancer, the cytomegalovirus early promoter enhancer, the 
' polyoma enhancer, and adenovirus enhancers are useful with eukaryotic promoter sequences. 
15 • While an enhancer may be spliced into the vector at a position 5* or 3' to nucleic acid coding 
sequence, it is typically located at a site 5' from the promoter. Other suitable enhancers are 
known in the art, and would be applicable to the present invention. 

While preparing reagents of the present invention, cells may need to be transfected or 
transformed. Transfection refers to the uptake of foreign or exogenous DNA by a cell, and a cell 
20 has been transfected when the exogenous DNA has been introduced inside the cell membrane. A 
number of transfection techniques are well known in the art (i.e., Graham et aL, 1973, Virology 
52:456; Sambrook et aL t Molecular Cloning, A Laboratory Manual (Cold Spring Harbor 
Laboratories, 1989);. Davis et al, Basic Methods in Molecular Biology (Elsevier, 1986); and Chu 
et al., 1981, Gene 13:197). Such techniques can be used to introduce one or more exogenous 
25 DNA moieties into suitable host cells. 

In certain embodiments, it is preferred that transfection of a cell results in transformation 
of that cell. A cell is transformed when there is a change in a characteristic of the cell, being 
transformed when it has been modified to contain a new nucleic acid.. Following transfection, 
the transfected nucleic acid may recombine with that of the cell by physically integrating into a 
30 chromosome of the cell, may be maintained transiently as an episomal element without being 

10 



BNSOOCtD: <WO___200S020370A2_I_> 



WO 2005/(126370 



PCT/US2O04/02K7M 



replicated, or may replicate independently as a plasmid. A cell is stably transformed when the 
nucleic acid is replicated with the division of the cell. 

The expression vectors of the present invention also provide for expression of fragments 
of immunogenic targets. Fragments may include sequences truncated at the amino terminus 
5 (with or without a leader sequence) and / or the carboxy terminus. Fragments may also include 
variants (i.e., allelic, splice), orthologs, homologues, and other variants having one or. more 
amino acid additions or substitutions or internal deletions as compared to the parental sequence. 
In preferred embodiments, truncations and/or deletions comprise about 1-5 amino acids, 5- IX) 
amino acids, 1.0-20 amino acids, 20-30 amino acids, 30-40 amino acids, 40-50 amino acids, or 
10 more. Such polypeptide fragments may optionally comprise an amino terminal methionine 
residue. It will be appreciated that such fragments can be used, for example, to generate 
antibodies or cellular immune responses to immunogenic targets. 

A variant is a sequence having one or more sequence substitutions, deletions, and/or 
additions as compared to the subject sequence. Variants may be naturally occurring or 
.15 artificially constructed. Such variants may be prepared from the corresponding nucleic acid 
molecules. In preferred embodiments, the variants have from 1 to 3, or from 1 to 5, or from 1 to 
1 0, or from 1 to 15, or from I to 20, or from 1 to 25, or from 1 to 30, or from 1 to 40, or from 1 
to 50, or more than 50 amino acid substitutions, insertions, additions and/or deletions. 

An allelic variant is one of several possible naturally-occurring alternate forms of a 
. 20 sequence occupying a given locus on a chromosome of an organism or a population of 
organisms. A splice variant is a polypeptide generated from one of several RN A . transcript 
resulting from splicing of a primary transcript. An ortholog is a similar nucleic acid or 
polypeptide sequence from another species. For example, the mouse and human versions of an 
immunogenic target. may be considered orthologs of each other. A derivative of a sequence is 
25 one that is derived from a parental sequence those sequences having substitutions, additions, 
deletions, or chemically modified variants. Variants may also include fusion proteins, which 
refers to the fusion of one or more first sequences (such as a peptide) at the amino or carboxy 
terminus of at least one other sequence (such as a heterologous peptide). 

"Similarity" is a concept related to identity, except that similarity refers to a measure of 
30 relaledness which includes both identical matches and conservative substitution matches. If two 

polypeptide sequences have, for example, 10/20 identical amino acids, and the remainder are all 
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non-conservative substitutions, then the percent identity and similarity would both be 50%. If in 
the same example, there are five more positions where there are conservative substitutions, then 
the percent identity remains 50%, but the percent similarity would be 75% (1 5/20). Therefore, in 
cases where there are conservative substitutions, the percent similarity between two polypeptides 
will be higher than the percent identity between those two polypeptides. 

Substitutions may be conservative, or non-conservative, or any combination thereof. 
Conservative amino acid modifications to the sequence of a polypeptide (and the corresponding 
modifications to the encoding nucleotides) may produce polypeptides having functional and 
chemical characteristics similar to those of a parental' polypeptide. For example, a "conservative 
amino acid substitution" may involve a substitution of a native amino acid residue with a non- 
native residue such that there is little or no effect on the size, polarity, charge, hydrophqbicity, or 
•hydrophilicity of the amino acid residue at that position and, in particlar, does not result in 
decreased immunogenicity. Suitable conservative amino acid substitutions are shown in Table I. 



Table 1 



Original 
Residues 


Exemplary Substitutions 


Preferred 
Substitutions 


Ala 


Val, Leu, lie 


Val 


Arg 


Lys, Gin, Asn 


Lys 


Asn 


Gin 


Gin 


Asp 


Glu 


Glu 


Cys 


Ser, Ala 


Ser 


Gin 


Asn 


Asn 


Glu 


Asp 


Asp 


Gly 


Pro, Ala 


Ala 


His 


Asn, Gin, Lys, Arg 


Arg 


He 


Leu, Val, Met, Ala, Phe, Norleucine 


Leu 


Leu 


Norleucine. He, Val, Met, Ala, Phe 


lie 


Lys 


Arg, 1,4 Diamino-butyric Acid, Gin, Asn 


Arg 


Met 


[ Leu, Phe, He 


Leu 


Phe 


Leu, Val. He, Ala, Tyr 


Leu 


Pro 


Ala 


Gly 


Ser 


Thr, Ala, Cys 


Thr 


• Thr 


Ser 


Ser 


-Trp 


Tyr, Phe 


Tyr 


' Tyr 


• Trp. Phe; Thr, Ser . 


Phe 


Val 


lie. Met. Leu, Phe, Ala, Norleucine 


Leu 
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A skilled artisan will be able to determine suitable variants of an immunogenic target 
using well-known, techniques. For identifying suitable areas of the molecule that may be 
changed without destroying biological activity (i.e., MHC binding, immunogenicity), one skilled 
5 in the art may target areas not believed to be important for that activity. For example, when 
immunogenic targets with similar activities from the same species or from other species are 
known, one skilled in the art may compare the amino acid sequence of a polypeptide to such 
similar polypeptides. By performing such analyses, one can identify residues and portions of the 
molecules that are conserved. It will be appreciated that changes in areas of the molecule that 
10 are not conserved relative to such similar immunogenic targets would be less likely to adversely 
affect the biological activity and/or structure of a polypeptide. Similarly, the residues Fequircd 
for binding to MHC are known, and may be modified to improve binding. However, 
modifications resulting in decreased binding to MHC will not be appropriate in most situations. 
One skilled in the art would also know that, even in relatively -conserved regions, one may 

.15 substitute chemically similar amino acids for the naturally occurring residues while retaining 
activity. Therefore, even areas that may be important for biological activity or for structure may 
be subject to conservative amino acid substitutions without destroying the biological activity or 
without adversely affecting the structure of the immunogenic target. 

Other preferred polypeptide variants include glycosylation variants wherein the number 

.20 ' and/or type of glycosylation sites have been altered compared to the subject amino acid 
sequence. In one embodiment, polypeptide variants comprise a greater or a lesser number of N- 
linked glycosylation sites than the subject amino acid sequence. An N-linked glycosylation site 
is characterized by the sequence Asn-X-Ser or Asn-X : Thr, wherein the. amino acid residue 
designated as X may be any amino acid residue except proline. The substitution of amino acid 

25 residues to create this sequence provides a potential new site for the addition of an N-linked 
carbohydrate chain. Alternatively, substitutions that eliminate this sequence* will remove an 
existing N-linked carbohydrate, chain. Also provided is a rearrangement of N-linked 
carbohydrate chains wherein one or more N-linked glycosylation sites (typically those that are 
naturally occurring) are eliminated and one or more new N-linked sites are created. To. affect O- 

30 linked glycosylation of a polypeptide, one would modify serine and / or threonine residues. 

13 
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Additional prefeired variants include cysteine variants, wherein one or more cysteine 
residues are deleted or substituted with another amino acid (e.g., serine) as compared to the 
subject amino acid sequence set. Cysteine variants are useful when peptides or polypeptides 
must be refolded into a biologically active conformation such as after the isolation of insoluble 
5 inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native 
protein, and typically have ah even number to minimize interactions resulting from unpaired 
cysteines. 

• In other embodiments, the peptides or polypeptides may be attached to one or more 
fusion segments 'that assist in purification of the polypeptides. Fusions can be made either at the 

10 amino terminus or at the carboxy terminus of the subject polypeptide variant thereof. Fusions 
may be direct with no linker or adapter molecule or may be through" Blinker or adapter molecule. 
* A linker or adapter molecule may be one or more amino acid residues, typically from about 20 to 
about 50 amino acid residues. A linker or adapter molecule may also be -designed with a 
• cleavage site for a DNA restriction endonuclease or for a protease to allow for the separation of 

15 the ftjsed moieties. It will be appreciated that once constructed; the fusion polypeptides can be 
derivatized according to the methods described herein, Suitable fusion segments include, among 
others, metal binding domains (e.g., a poly-histidine segment), immunoglobulin binding domains 
(i.e., Protein A, Protein G, T cell, B cell, Fc receptor, or complement protein antibody-binding 
domains), sugar binding domains (e.g., a maltose binding domain), and/or a "tag" domain (i.e., at 

20 least a portion of a-galactosidase, a strep tag peptide, a T7 tag peptide, a FLAG peptide, or other 
domains that can be purified using compounds that bind to the domain, such as monoclonal 
antibodies). This tag is typically fused to the peptide or polypeptide and upon expression may 
serve as a means for affinity purification of the sequence of interest polypeptide from the host 
cell. "Affinity purification can be accomplished, for example, by column chromatography using 

25 antibodies against the tag as an affinity matrix. Optionally, the tag can subsequently be removed 
from the purified sequence of interest polypeptide by various means such as using certain 
peptidases for cleavage.. As described below, fusions may also be made between a TA and a co- 
. stimulatory components such as the chemokines CXC10 (IP- 10), CCL7 (MCP-3), or CCL5 
(RANTES), for example. 

30 A fusion motif may enhance transport of an immunogenic target to an MHC processing 

compartment, such as the endoplasmic reticulum. These sequences, referred to as tranduction or 
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transcytosis sequences, include sequences derived from HIV tat (see Kim et al. 1997 J. Immunol. 
159:1 6(>6) > .Drosophila antennapedia (see Schutze-Redelmeier et al. 1996 J. Immunol. 1 57:650), 
or human period- 1 protein (hPERl ; in particular, SRRHHCRSKAKRSRHH). 

In addition, the polypeptide or variant thereof may be fused to a homologous peptide or 

5 polypeptide to form a homodimer or to a heterologous peptide or polypeptide to form a 
heterodimen Heterologous peptides and polypeptides include, but are not limited to an epitope 
to allow for the detection and/or isolation of a fusion polypeptide; a transmembrane receptor . 
protein or a portion thereof, such as an extracellular domain or a transmembrane and intracellular 
domain; a ligand or a portion thereof which binds to a transmembrane receptor protein; an 

10 enzyme or portion thereof which is catalytically active; a polypeptide or peptide which promotes 
oligomerization, such as a leucine zipper domain; a polypeptide or peptide which increases 
stability, such as an immunoglobulin constant region; apeptide or polypeptide which has a 
therapeutic activity different from the peptide or polypeptide; and/or variants thereof. 

In certain embodiments, it may be advantageous to combine a nucleic acid sequence 

15 encoding an immunogenic target with one or more co-stimulatory component(s) such as cell 
surface proteins, cytokines or chemokines in a composition of the present invention. The co- 
stimuJatory component may be included in the composition as a polypeptide or as a nucleic acid 
encoding the polypeptide, for example. Suitable co-stimulatory molecules include, for instance, 
polypeptides that bind members of the CD28 family (i.e., CD28, 1COS; Hutloff, et al. Nature 

20 1 999, 397: 263-265; Peach, et al. J Exp Med 1994, 1 80: 2049-2058) such as the CD28 binding 
polypeptides B7.1 (CD80; Schwartz, 1992; Chen et al, 1992; Ellis, et al. - J. Immunol, 156(8): 
2700-9), B7.2 (CD86; Ellis, et al. I Immunol, 156(8): 2700-9), and mutants / variants thereof 
(WO 00/66162); polypeptides which bind members of the integrin family (i.e., LEA-1 (CDila / 
CD18); Sedwick, et al. J Immunol 1999, 162: 1367-1375; Wulfmg, et al. Science 1998, 282: 

25 2266-2269; Lub, et al. Immunol Today 1995, 16: 479-483) including members of the ICAM 
. family (i.e., ICAM-1, -2 or -3); polypeptides which bind CD2 family members (i;e., CD2, 
signalling lymphocyte activation molecule (CDwl50 or "SLAM"; Aversa, et al. 
J Immunol 1997, 158: 4036-4044)) such as CD58 (LFA-3; CD2 ligand; Davis, et al. Immunol 
Today 1996, 17: 177-187) or SLAM ligands (Sayos, et.al. Nature 1998, 395: 462-469); 

30 polypeptides which bind heat stable antigen (HSA or CD24; Zhou, et al. Eur J Immunol 1997, 

27: 2524-2528); polypeptides which bind to members of the TNF receptor (TNFR) family- (i.e., 
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4-1BB (CD 137; Vinay, et al. Semi* Immunol 1998, 10: 481-489), OX40 (CD134; Weinberg, et 
al. Senun Immunol 1998, 10: 471-480; Higgins/et al. J Immunol 1999, 162: 486-493),. and 
CD27 (Lens, et al. Senun Immunol 1998, 10: 491-499)) such as 4-1BBL (4-1 BB ligand; Vinay, 
et al. Semin Immunol 1998, 10: 481-48; DeBenedette, et al. J Immunol 1997, 158: 551-559), 
5 TNFR associated factor-1 (TRAF-1; 4-1BB ligand; Saoulli, et al. J Exp Med 1998, 187: 1849- 
1862, Arch, et al. Mol Cell Biol 1998, 18: 558-565), TRAF-2 (4-1BB and OX40 .ligand'; Saoulli, 
et al. J Exp Med 1998, 187: 1849-1862; Oshima, et al. Int Immunol 1998, 10: 517-526,. 
Kawamata, el al. J Biol Chem 1998, 273: 5808-5814), TRAF-3 (4-lBB .and OX40 ligand; Arch, 
et al. . Mol Cell Biol 1998, 1 8: 558-565; Jang, et al. Biochem Biophys Res Commun 1998, 242: 
10 613-620; Kawamata S, et al. / Biol Chem 1998, 273; 5808-5814),. OX40L (OX40 ligand; 
Gramaglia, et al. J Immunol 1998, 161: 6510-6517), TRAF-5 (OX40 ligand; Arch, et al. Mol 
Cell Biol 1998, 18: 558-565; Kawamata, et al. J Biol Client 1998, 273: 5808-5814), and CD70 
(CD27 ligand; Couderc, et a!. Cancer Gene The,:, 5(3): 163-75). ' CD134 (CD40 ligand or 
"CD40L"; Gurunathan, et al. J. Immunol,, 1998, 161: 4563-4571; Sine, et al. Hum. Gene Titer., 
.15 2001, 12: 1091-1 102) may also be suitable. 

. One or more cytokines may also be suitable co-stimulatory components or "adjuvants", 
either as polypeptides or being encoded by nucleic acids contained within the compositions of 
the present invention (Parmiani, et al. Immunol Lett 2000 Sep 15; 74(1): 41-4; Berzofsky, et al. 
Nature Immunol. 1: 209-219). Suitable cytokines include, for example, interleukin-2 (1L-2) 
20 • (Rosenberg, et al. Nature Med. 4: 321-327 (1998)), IL-4. IL-7, TL-12 (reviewed by Pardpll, 
1992; Harries, et al. J. Gene Med. 2000 Jul-Aug;2(4):243-9; Rao, et al. J: Immunol. 156: 3357- 
3365 (1996)), IL-15 (Xin, et al. Vaccine, 17:858-866, 1999), 1L-16 (Cruikshank, etal. J. Leuk 
Biol. 67(6): 757-66, 2000), IL-18 {J. Cancer Res. Clin. Oncol. 200L 127(12): 718-726), GM- 
CSF (CSF (Disis, et al. Blood, 88: 202-210 (1996)), tumor necrosis factor-alpha (TNF-d), or 
25 interferons such as IFN-a or INF-7. Other cytokines may also be suitable for practicing the 
present invention, as is known in the art. 

Chemokines may also be utilized, in either polypeptide or nucleic acid forni. Fusion 
proteins comprising CXCL10 (IP-10) and C.CL7 (MCP-3) fused to a tumor self-antigen have 
been shown to induce anti-tumor immunity (Bimgyn, et al. Nature Biotech. 1999, 17: 253-258). 
30 . The chemokines CCL3 (MJP-la) and CCL5 (RANTES) (Boyer, et al; Vaccine, 1999, 17 (Supp. 
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2): S53-S64) may also be of use in practicing the present invention. Other suitable chemokines 
are known in the art. 

It is also knowu in the art that suppressive or negative regulatory immune mechanisms 
may be blocked, resulting in enhanced immune responses. For instance, treatment with anti- 
5 CTLA-4 (Shrikant, et al. Immunity, 1996, 14: 145-155; Sutmuller, et al. J. Exp. Med., 2001, 1 
194: 823-832), anti-CD25 (Sutmuller, supra), anti-CD4 (Matsui, et al. J. Immunol., 1999, 163: 
184-193), the fusion protein IL13Ra2-Fc (Terabe, et al. Nature Immunol., 2000, 1: 515-520), 
and combinations thereof (i.e., anti-CTLA-4 and anti-CD25, Sutmuller, supra) have been shown 
to upregulat'e anti-tumor immune responses and would be suitable in practicing the present 

10 invention. Such treatments, among others,. may also be combined with the one or more 
immunogenic targets of the present invention. 

Any of these components may be used alone or in combination with other agents. For 
instance, it has been shown that a combination of CD80, ICAM-1 and LFA-3 ("TRICOM") may 
potentiate anti-cancer immune responses (Hodge, et al. Cancer Res. 59: 5800-5807 (1999).. 

15 Other effective combinations include, for example, IL-1 2 + GM-CSF (Ahlers, et al. J. Immunol, 
158:3947-3958 (1997); Iwasaki, et al. /.Immunol 158:4591-4601 (1997)), 1L-12 + GM-CSF + 
TNF-a (Ahlers, et al. Int. Immunol 13: 897-908 .(2001)), CD80 + IL-1 2 (Fruend, et al. Int. J. 
Cancer, 85: 508-517 (2000); Rao, et al. supra), and CD86 + GM-CSF + IL-12 (Iwasaki, supra). 
One of skill in the art would be aware of additional combinations useful in carrying out the 

20 present invention. In addition, the skilled artisan would be aware of additional reagents or 
methods that may be used to modulate such mechanisms. These reagents and methods, as well 
as others known by those of skill in the art, may be utilized in practicing the present invention. 

Additional strategies for improving the efficiency of nucleic acid-based immunization 
may also be used including, for example, the use of self-replicating viral replicons (Caley, et al. 

25 1999. Vaccine, 17: 3124-2135; Dubensky, et al. 2000. Mol Med. 6: 723-732; Leitner, et al. 
2000. Cancer Res. 60: 51-55), codon optimization (Liu, et al. 2000. Mol Tlier., 1: 497-500; 
Dubensky, supra; Huang,. et al. 2001. J. Virol 75: 4947-4951), in vivo electroporation (Widera, 
et al. 2000. /. Immunol 164: 4635-3640), incorporation of CpG stimulatory motifs 
(Gurunathan, et al. Ann. Rev. Immunol, 2000, 18: 927-974; Leitner, supra', Cho, et al. J. 

30 Immunol. 168(10):4907-13), sequences for targeting of the endocytic or Ubiquitin-processing 
pathways (Thomson, et al. 1998. 7. Virol 72: 2246-2252; Velders, et al. 2001. J. Immunol 
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1 66: 5366-5373), Marek's disease virus type 1 VP22 sequences (J. Virol. 76(6):2676-82, 2002), 
prime-boost regimens (Gurunathan, supra; Sullivan, et al. 2000. Nature, 408: 605-609; Hanke, 
et al. 1998. Vaccine, 16: 439-445; Amara, et al. 2001. Science, 292: 69-74), and the use of 
mucosal delivery vectors such as 5a/wo«e//fl. (Darji,et al. 1997. Cell, 91: 765-775;" Woo", et al. 
5 2001. Vaccine, 19: 2945-2954). Other methods are known in the art, some of which are 
described below. 

Chemotherapeutic agents, radiation, anti-angiogenic compounds, or other agents may 
also be utilized in treating and / or preventing cancer using immunogenic targets (Sebti, et al. 
Oncogene 2000 Dec 27;19(56):6566-73). For example, in treating metastatic melanoma, suitable 
10 chemotherapeutic regimens may include BELD (bleomycin, vindesine, lomustine, and 
deacarbazine; Young, ct al. 1985. Cancer, 55: 1879-81), BOLD (bleomycin, vincristine, 
lomustine, dacarbazine; Seigler, et al. 1980. Cancer, 46: 2346-8);- DD (dacarbazine, 
actinomycin ; Hochster, et al. . Cancer Treatment Reports, 69: 39-42), or POC (procarbazine, 
vincristine, lomustine; Carmo-Pereira, et al. 1984. Cancer Treatment Reports, 68: 1211-4) 
15 among others. Other suitable chemotherapeutic regimens may also be utilized. 

Many anti-angiogenic agents are known in the art and would be suitable for co- 
administration with the immunogenic target vaccines and/or chemotherapeutic regimens (see, for 
example, Timar, et al. 2001. Pathology Oncol. Res., 7(2): 85-94). Such agents include, for 
example, physiological agents such as growth factors (i.e., ANG-2, NKl,2,4 (HGF), 
,20 transforming growth factor beta (TGF-P)), cytokines (i.e., interferons such as IFN-a, -P..-Y. 
platelet factor 4 (PF-4), PR-39), proteases (i.e., cleaved AT-III, collagen . XVHI fragment 
(Endostatin)), HmwKallikrein-d5 plasmin fragment (Angiostatin), prothroinbin-Fl-2, TSP-l), 
protease inhibitors (i.e., tissue' inhibitor of metalloproteases such as TIMP-1, -2, or -3; maspin; 
plasminogen activator-inhibitors such as PAI-1; pigment epithelium derived factor (PEDF)), 
25 Tumstatin (available through ILEX, Inc.), antibody products (i.e., the collagen-binding 
antibodies HUIV26, HU177, XL313; anti-VEGF; anti-integrin. (i.e., Vitaxin, {Lxsys))), and 
glycosidases (i.e., heparinase-l, -III). "Chemical", or modified physiological agents known or 
believed to have anti-angiogenic potential include, for example, vinblastine, taxol, ketoconazole, 
thalidomide, dolestatii), combrestatin A, rapamycin (Guba, et al. 2002, Nature Med., 8: 128- 
30 1 35), CEP-7055 (available from Cephalon, Inc.), flavone acetic acid, Bay 12-9566 (Bayer 
Corp.), AG3340°(Agouron, inc.), CGS 27023A (Novartis), tetracylcine derivatives (i.e., COL-3 
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(Collagenix, Lie.)), Neovastat (Aeterna), BMS-275291 (Bristol-Myers Squibb), low dose 5-FU, 
low dose methotrexate (MTX), irsofladine, radicicol, cyclosporine, captopril, celecoxib, 

• D45152-sulphated polysaccharide, cationic protein (Protamine), cationic peptide- VEGF, 
Suramin (polysulphonated napthyl urea), compounds that interfere with the function or 

5 production of VEGF (i.e., SU5416 or SU6668 (Sugen), PTK787/ZK225S4 (Novartis)), 
Distamycin A, Angiozyme (ribozyme), isoflavinoids, staurosporine derivatives, genistein, 
EMD121974 (Merck KcgaA), tyrphostins, isoquinolones, retinoic acid, carboxyamidotriazole, 
TNP-470, octreotide, 2-methoxyestradiol, aminosterols (i.e., squalamine), glutathione analogues 
(i.e., N-acteyl-L-cysteine), combretastatin A-4 (Oxigene), Eph receptor blocking agents (Nature, 

10 414:933-938, 2001), Rh-Angiostatin, Rh-Endostatin (WO 01/93897), cydic-RGD peptide, 
accutin-disinlegrin, benzodiazepenes," humanized anti-avb3 Ab, Rh-PAl-2, amiloride, p- 
amidobenzamidine, anti-uPA ab, anti-uPAR Ab, L-phanylalanin-N-inethylamides (i.e.j 
, Batimistat, Marimastat), AG3340, and minocycline. Many other suitable agents are known in 
the art and would suffice in practicing the present invention. 

15 The present invention may also be utilized in combination with "non-traditional" 

methods of treating cancer. For example, it has recently been demonstrated that administration 
of certain anaerobic bacteria may assist in slowing tumor- growth. In one study, Clostridium 
novyi was modified to eliminate a toxin gene carried on a phage episome and administered to 

* mice with colorectal tumors (Dang, et al. RN.A.S. USA, 98(26): 15155-15160, 2001). In 
20 combination with chemotherapy, the treatment was shown to cause tumor necrosis in the 

animals. The reagents and methodologies described in. this application may be combined with 

such treatment methodologies. 

Nucleic acids encoding immunogenic targets may be administered to patients by any of 

several available techniques. Various viral vectors that have been successfully utilized for 
25' introducing a nucleic acid to a host include retrovirus, adenovirus, adeno-associated virus 

(AAV), herpes virus, and poxvirus, among others. It is understood in the art that many such viral 

vectors are available in the art. The vectors of the present invention may be constructed using 
. . standard recombinant techniques widely available to one skilled in the art. Such techniques may 

be found in common molecular biology references such as Molecular Cloning: A Laboratory 

30 Manual (Sambrook, et al.; 1989, Cold Spring Harbor . Laboratory Press), Gene Expression 

Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 199]. Academic Press, 
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San Diego., CA), and PCR Protocols: A Guide to Methods and Applications (Ihnis, etal. 1990. 
Academic Press, San Diego, CA). 

Preferred retroviral vectors are derivatives of lentivirus as well as derivatives of murine 
or avian retroviruses. Examples of suitable retroviral vectors include, for example, Moloney 
5 murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary 
tumor virus (MuMTV), SIV, BIV, HIV and Rous Sarcoma Virus (RSV). A number of retroviral 
vectors can incorporate multiple exogenous nucleic acid sequences. As recombinant retroviruses 
are defective, they require assistance in order to produce infectious vector particles. -This 
assistance can be provided by, for example, helper cell lines encoding retrovirus structural genes. 
10 Suitable helper cell lines include ¥2, PA317 and PA12, among others; The vector virions 
produced using' such cell lines may then be used to infect a tissue cell line, such as NIH 3T3 
cells, to produce large quantities of chimeric retroviral virions. Retroviral vectors may be 
administered by traditional methods (i.e., injection) or by implantation of a "producer cell line" 
in proximity to the target cell population (Culver, K., el al, .1994, Hum. Gene Titer., 5 (3): 343- 
15 79; Culver, K., et al, Cold Spring Barb. Symp. Quant Biol, 59: 685-90); Oldfield, E., 1993, 
Hum. Gene Ther., 4 (1): 39-69). The producer cell line is engineered to produce a viral vector 
and releases viral particles in the vicinity of the target cell. A portion of the released viral 
particles contact the target cells and infect those cells, thus delivering a nucleic acid of the 
present invention to the target cell. Following infection of the target cell, expression of the 
. 20 ' nucleic acid of the vector occurs. 

Adenoviral vectors have proven especially useful for gene transfer into eukaryotic cells 
(Rosenfeld, M., et at, 1991, Science, 252 (5004): 431-4; Crystal, R., et al, 1994, Nat Genet, 8 
(1): 42-51), the study eukaryotic gene expression (Levrero, ML, et al, 1991, Gem, 101 (2): 195- 
202), vaccine development (Graham, F. and Preyec, L., 1992, Biotechnology, 20: 363-90), and in 
25 animal models (Stratford-Perricaudet, L., et al. t 1992, Bone Marrow Transplant, 9 (Suppl. 1): 
151-2 ; Rich, D., et' al, 1993, Hum Gene Ther., 4 (4): 461-76). Experimental routes for 
administrating recombinant Ad to different tissues in vivo have included intratracheal instillation 
(Rosenfeld, M., et at, 1992, Cell, 68 (1): 143-55) injection into muscle (Quantin, B., et at, 1992, 
Proc. Natl. Acad. Set U.S.A., 89 (7): 2581-4), peripheral intravenous injection (Herz, J., and 
30 GeraTd, R., 1993, Proc. Natl. Acad. Set U.S.A., 90 (7): 2812-6) and stereotactic inoculation to 

brain (Le Gal La SaUe, G., et al, 1 993, Science, 259 (5097): 988-90), among others. 

20 



evSDOCtO: <WO 2005Q2fl37tt*2 I > 



WO 2005/025370 



PCT/US200^/02K751 



Adeno-associated virus (AAV) demonstrates high-level infectivity, broad host range and 
specificity in integrating into the host cell genome (Hermonat, P., et al., 1984, Proc.fifatl Acad. 
• Sci. U.S.A., 81 (20): 6466-70). And Herpes Simplex Virus type-1 (HSV-1) is yet another 
• attractive vector system, especially for use in the nervous system because of its neurotropic 
5 property (Geller, A., et al. t 1991, Trends Neurosci., 14 (10): 428-32; Glorioso, et al., 1995, Mol 
Biotechnol, 4 (1): S7-99; Glorioso, et al, 1995, Annw Rev. Microbiol, 49: 675-710). 

Poxvirus is another useful expression vector (Smith, et al. 1983, Gene, 25 (1): 21-8; 
Moss', et al, 1992, Biotechnology, 20: 345-62; Moss, et al, 1992, Curr. Top. Microbiol Immunol., 
• 158:. 25-38; Moss, et at. 1991. Science, 252: 1662-1667).. Poxviruses shown to be useful 
10 include vaccinia, NYVAC, avipox, fowlpox,. canarypox, ALVAC, and ALVAC(2), among 
others. 

NYVAC (vP866) was derived from the Copenhagen vaccine strain of vaccinia virus by 
deleting six nonessential regions of the genome encoding known or potential virulence factors 
(see, for example, U.S. Pat. Nos. 5,364,773 and 5,494,807). The deletion loci were also 

15 engineered as recipient loci for the insertion of foreign genes. The deleted regions are: 
thymidine kinase gene (TK; J2R); hemorrhagic region .(u; B13R+B14R); A type inclusion body 
region (ATI; A26L); hemagglutinin gene (HA; A56R); host range gene region (C7L-K1L); and, 
large subunit, ribonucleotide reductase (I4L). NYVAC is a genetically engineered vaccinia virus 
strain that was generated by the specific deletion of eighteen open reading frames encoding gene 

20 products associated with virulence and host. range. NYVAC has been show to be useful for 
expressing TAs (see, for example, U.S. Pat. No. 6,265,189). NYVAC (yPS66), vP994, vCP205, 
vCP1433, placZH6H4Lreverse, pMPC6H6K3E3 and pC3H6FHVB were also deposited with the 
ATCC under the terms of the Budapest Treaty, accession numbers VR-2559, VR-2558, VR- 
2557/VR-2556, ATCC-97913, ATCC-97912, and ATCC-97914, respectively. 

25 ALVAC-based recombinant viruses' (i.e., ALVAC-1 and ALVAC-2) are also suitable for 

use in practicing the present invention (see, for example, U.S. Pat. No. 5,756,103). ALVAC(2) 
is identical to ALVAC(l) except that ALVAC(2) genome comprises the vaccinia E3L and K3L 
genes under the control of vaccinia promoters (U.S. Pat. No. 6,330,066; Beattie et al., 1995a, 
. 1995b, 1991; Chang et al., 1992; Davies et al, 1993). Both ALVAC(l) and ALVAC(2) have 

30 been demonstrated to be useful in expressing foreign DNA sequences, such as TAs (Tartaglia et 
al., 1993 a,b; U.S. Pat. No. 5,833,975). ALVAC was deposited under the terms of the Budapest 
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Treaty with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas, Va. 201 10-2209, USA, ATCC accession number VR-2547. 

Another useful poxvirus vector is TROVAC. TROVAC refers to an attenuated fowlpox 
that was a plaque-cloned isolate derived from the FP-1 vaccine strain of fowlpoxvirus which is 
5 licensed for vaccination of 1 day old chicks. TROVAC was likewise deposited under the terms 
of the Budapest Treaty with the ATCC, accession number 2553. 

"Non-viral" plasmid vectors may also be suitable in practicing the present invention. 
Preferred plasmid vectors are compatible with bacteriai, insect, and / or mammalian host cells. 
Such vectors include, for example, PCR-II, P CR3, and pcDNA3.l (lnvitrogen, San Diego, CA), 
]0 pBSII (Stratagene, La Jolla, CA), pETl 5 (Novagen, Madison,' WI), pGEX (Pharmacia Biotech, 
' Piscataway, NJ), pEGFP-N2 (Clontech, Palo Alto, CA), pETL (BlueBacII, lnvitrogen), pDSR- 
alpha (PCT pub. No. WO 90/1 4363) and pFastBacDual (Gibco-BRL, Grand Island, NY) as well 
as Bluescript 81 plasmid derivatives (a high copy number COLE1 -based phagemid, Stratagene 
Cloning Systems, La Jolla, CA), PCR cloning plasmids designed for cloning Taq-amplified PCR 
l s products (e.g. , TOPO™ TA cloning® kit, PCR2. 1® plasmid derivatives, Invinogen, Carlsbad, 
CA). Bacterial vectors may also be used with the current invention. These vectors include, for 
example, Shigella, Salmonella, Vibrio cholerae, Lactobacillus. Bacille calmette guerin (BCG). 
and.Streptococcus (see for example, WO 88/6626; WO 90/0594; WO 91/13157; WO 92/1796; 
and WO 92/21 376). Many other non-viral plasmid expression vectors and systems are known in 
20 the art and could be used with the current invention. 

Suitable nucleic acid delivery techniques include DNA-ligand complexes, adenovirus- 
' ligand-DNA complexes, direct injection of DNA, CaP0 4 precipitation, gene gun technfques, 
electroporation, and colloidal dispersion systems, among others. Colloidal dispersion systems 
include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems 
25 including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The preferred 
colloidal system of this invention is a liposome, which are artificial membrane vesicles useful as 
delivery vehicles in vitro and in vivo. RNA, DNA and intact virions can be encapsulated within 
■ the aqueous interior and be delivered to cells in a biologically active form (FraJey, R., et ai, 
1981, Trends Biochem. Sci., 6: 77). The composition of the liposome is usually a combination 
30 of phospholipids, particularly high-phase-transition-temperature phospholipids, usually in 
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combination with steroids, especially cholesterol. Other phospholipids or other lipids may also, 
be used. The physical characteristics of. liposomes depend on pH, ionic strength, and the 
presence of divalent cations. Examples of lipids useful in liposome production include 
phosphatidyl compounds, such as phosphatidylglycerol, phosphatidylcholine, 

5 phosphatidylserine, phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides. 
Particularly useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative phospholipids 
include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

10 An immunogenic target may also be administered in combination with one or more 

adjuvants to boost the immune response;. Exemplary adjuvants are shown in Table II below: 

Tabic II 

Types of Immunologic A djuvants i 



Type of 
Adjuvant 


General Examples 


Specific Examples/References 


Gel-type 


Aluminum hydroxide/phosphate ("alum 
adjuvants") 


(Aggerbeck and Heron, 1995) 


Calcium phosphate 


(Rely veld, 1986) 


Microbial 


Muramy! di peptide (MDP) 


(Chcdid et bI. v 1986) 


Bacterial exotoxins 


Cholera toxin (CT), E.coli labile toxin 
(LT)(Frey tag and Clements, 1999) 


Endotox in-based adjuvants 


Monophosphoryl lipid A (MPL) 
(Ulrich and Myers, 1995) 


Other bacterial' " 


CpG oligonucleotides (Corral and 
Petray, 2000), BCG sequences (Krieg, et 
a!. Nature, 374:576), tetanustoxoid 
(Rice, et al. /. Immunol.' , 2001, 1 67: 
1558-1565) 


Particulate 


Biodegradable 
Polymer microspheres 


(Gupta etal., 1998) 


Irnmunoslimulatory complexes 
(ISCOMs) 


(Morein and Bengtsson, 1999) ; 


Liposomes 


(Wassefetal., 1994) 


Oil-emulsion 
and 

surfactant- 
based 
adjuvants 


Freund's incomplete odjuvant 


(Jensen etal. f 1998) 


Micro flu idized emulsions 


MF59 (Ott et al M 1995) 


SAF (Allison and Byars, 1992) 
(Allison, 1999) 


Saponins 


QS-21 (Kensil, 1996) 


Synthetic 


Muramyl peptide derivatives 


Murabutide (Ledercr, 1986) 
Threony-MDP (Allison, 1997) 


Noruonic block copolymers 


L121 (Allison, 1999) 


Polyphosphazene (PCPP) 


(Payne etal., 1995) 
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i Synthetic polynucleotides 


PoW A:U, Polv l:C (Johnson, 1994) 




Thalidomide derivatives 


CC-4047/ACTIMID (J. Immunol, 
168(10*4914-9} 



' Administration of a composition of the present invention to a host may be accomplished 
' using any of a variety of techniques known to those of skill in the art. The coraposition(s) may 
be processed in accordance with conventional methods of pharmacy to produce medicinal agents 
5 for administration to patients, including humans and other mammals (i.e., a "pharmaceutical 
composition"). The pharmaceutical composition is preferably made in the form of a dosage unit 
containing a- given amount of DNA, viral vector particles, polypeptide or peptide, for example. A 
suitable daily dose for a human or other mammal may vary widely depending on the condition of 

• the patient and other factors, but, once again, can be determined using routine methods. 

10 • The pharmaceutical composition may be administered orally, parentally, by inhalation 

' spray, rectally, intranodal!^ or topically in dosage unit formulations containing conventional 
phaimaceutically acceptable earners, adjuvants, and vehicles. The term "pbarmaceutically 
acceptable carrier" or "physiologically acceptable carrier" as used herein refers to one or more 
formulation materials suitable for accomplishing or enhancing the delivery of. a nucleic acid, 

15 polypeptide, or peptide as a pharmaceutical composition. A "pharmaceutical composition" is a 
composition comprising a therapeutically effective amount of a nucleic acid or polypeptide. The 
terms "effective amount" and "therapeutically effective amount" each refer to the amount of a 

• nucleic acid or polypeptide used to induce, or enhance an effective immune response. It is 
preferred that compositions of the present invention provide for the induction or enhancement of 

20 ' an anti-tumor immune response in a host which protects the host from the development of a 
tumor and / or allows the host to eliminate an existing tumor from the body. 
. . For oral administration, the pharmaceutical composition may be of any of several forms 
including, for example, a capsule, a tablet, a suspension, or liquid, among others. Liquids may 
' ' be administered by injection as a composition with suitable .carriers including saline, dextrose, or 
25 water. The term parenteral as used herein includes subcutaneous, intravenous, intramuscular, 
' intrastemal, infusion; or intraperitoneal administration. Suppositories for rectal administration of 
the drug can' be prepared by mixing the drug with a suitable non-irritating excipient such as 
• ' cocoa butter and polyethyiene glycols that are solid at ordinary temperatures but liquid at the 
rectal temperature. 
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The dosage regimen for immunizing a host or otherwise treating a disorder or a disease 
with a composition of this invention is based on a variety of factors, including the type of 
disease, the age, weight, sex, medical condition of the patient, the severity of the condition, the 
route of administration, and the particular compound employed. For example, a poxviral vector 
5 may he administered as a composition comprising 1 x 10 6 infectious particles per dose. Thus, 
the dosage regimen may vary widely, but can be determined routinely using standard methods. 

A prime-boost regimen may also be utilized (WO 01/30382 Al) in which the targeted, 
immunogen is initially administered in a priming step in one form followed by a boosting step in 
which the targeted immunogen is administered in another form. The fonn of the targeted 
]0 * immunogen in the priming and boosting steps are different. For instance, if the priming step 
utilized a nucleic acid, the boost may be administered as a peptide. Similarly,'Svh'6re 'a priming 
step utilized one type of recombinant virus (i.e., ALVAC), the boost step may utilize another 
type of vims (i.e., NYVAC). This prime-boost method of administration has been shown to 
induce strong immunological responses. Various combinations of forms are suitable in 
.15 practicing the present invention. 

While the compositions of the invention can be administered as the sole active 
pharmaceutical agent, they can also'be used in combination with one or more other compositions 
or agents (i.e., other immunogenic targets, co-stimulatory molecules, adjuvants). When 
administered as a combination, the individual components can be formulated as separate 
•20 compositions administered at the same time or different times, or the components' can be 
combined as a single composition. 

Injectable preparations, such as sterile injectable aqueous or oleaginous suspensions, may 
be formulated according to lenown methods using, suitable dispersing or wetting agents and 
suspending agents. The injectable preparation may also be a sterile injectable solution or 
• 25 suspension in a non-toxic parenterally acceptable diluent or solvent.. Suitable vehicles and 
. solvents that may be employed are water, Ringer's solution, and isotonic sodium chloride 
solution, among others. For instance, a viral vector such as a poxvirus may be prepared in 0.4% 
NaCl. In addition, sterile, fixed oils are conventionally employed as a solvent or suspending 
medium. For this purpose, any bland fixed oil may be employed, including synthetic mono- or 
30 diglycerides. In addition, fatty acids such as oleic acid find' use in the preparation of injectables. 
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For topical administration, a suitable topical dose of a composition may be administered 
one to four, and preferably two or three times daily. The dose may also be administered with 
intervening days during which no does is applied. Suitable compositions may comprise from 
0.001% to 10% w/w, for example, from 1% to* 2% by weight of the formulation, although it may 
. 5 comprise as much as 10% w/w, but preferably not more than 5% w/w, and more preferably from 
0.1 % to 1% of the formulation. Formulations suitable for topical administration include liquid or 
semi-liquid preparations suitable for penetration through the skin (e.g., liniments, lotions, 
ointments, creams, or pastes) and drops suitable for administration to the eye, ear, or nose. 

The pharmaceutical compositions may also be prepared in a solid form (including 
10 granules, powders or suppositories). The pharmaceutical compositions may be subjected to 
conventional pharmaceutical operations such as sterilization and/or may contain conventional 
adjuvants, such as preservatives, stabilizers, wetting agents, emulsifiers, buffers etc. Solid 
dosage forms for oral administration may include capsules, tablets, pills, powders,.and granules. 
• In such solid dosage forms, the active compound may be admixed with at least one inert diluent 
!5 such as sucrose/lactose, or starch. Such dosage foiros may also comprise, as in normal practice, 
additional substances other than inert diluents, e.g., lubricating agents such as magnesium 
' stearalc, In the case of capsules, tablets, and pills, the dosage forms may also comprise buffering 
agents. Tablets and pills can additionally be prepared with enteric coatings. Liquid dosage, 
forms for oral administration may include pharmaceutical^ acceptable emulsions, solutions, 
20 suspensions, syiups, and elixirs containing inert diluents commonly used in the art, such as 
water. Such compositions may also comprise adjuvants, such as wetting sweetening, flavoring, 
and perfuming agents. 

Pharmaceutical compositions comprising a nucleic acid or polypeptide of the present 
invention may take any of several forms and may be administered by any of several routes. In 

25 . preferred embodiments, the compositions are administered via a parenteral route (intradermal, 
intramuscular or subcutaneous) to induce an immune response in the host. Alternatively, the 
composition may be administered directly into a lymph node (intranodal) or tumor mass (i.e., 
intratumoral administration). For example, the dose could be administered subcutaneously at 
days 0, 7, and 14. Suitable methods for immunization using compositions comprising TAs are 

30 known in the art, as shown for pS3 (Hollsteiri et al. f 1991), p21-ras (Almoguera et al., 1988), 

HER-2 (Fendly et.al., 1990), the melanoma-associated antigens (MAGE-1; MAGE-2)- (van der 
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Bruggen et al., 1993), p97 (Hu et al, 1988), melanoma-associated antigen E.(WO 99/30737) and 
carcinoerabryonic antigen (CEA) (Kantor et a!., 1993; Fishbein et al., 1992; Kaufinan et al., 
1991), among others. 

Preferred embodiments of administratable compositions include, for example, nucleic 
acids or polypeptides in liquid preparations such as suspensions, syrups, or elixirs.' " Preferred 
injectable preparations include, for example, nucleic acids or polypeptides suitable for parental, 
subcutaneous, intradermal, intramuscular or intravenous administration such as sterile 
suspensions or emulsions. For example, a recombinant poxvirus may be in admixture with a 
suitable carrier, diluent, or excipient such as sterile water, physiological saline, glucose or the 
like. The composition may also be provided in lyophiJized form for reconstituting, for instance, 
in isotonic aqueous, saline buffer. Tn addition, the compositions can be co-administered or 
sequentially administered with other antineoplastic, anti-tumor or anti-cancer agents and/or with 
agents which reduce or alleviate ill effects of antineoplastic, anti-tumor or anti-cancer agents. 

A kit comprising a composition of the present invention is also provided. The kit caii 
include a separate container containing a suitable carrier, diluent or excipient The kit can also 
include an additional. anti-cancer, anti-tumor or antineoplastic agent and/or an agent that reduces 
or alleviates ill effects of antineoplastic, anti-tumor or anti-cancer agents for co- or sequential-, 
administration. Additionally, the kit can include instructions for mixing or combining ingredients 
and/or administration. 

A better understanding of the present invention and of its many advantages will be had 
from the following examples, given by way of illustration. 

EXAMPLES 
Example 1 

Construction of the Multi-Antigen Construct vT416 
The .expression vector vT416 (ALVAC-NY-ESO-l/Trp-2-LFA-3/ICAM-l/B7.1- 
E3L/K3L) was constructed in the ALVAC vector using standard techniques. DNA sequences 
encoding NY-ESO-1, Trp-2, LFA-3, ICAM-1, B7.1, vvE3L and wK3L were inserted into 
various loci within the ALVAC genome. DNA sequences encoding NY-ESO-1 (Chen et al. 

1997 PNAS 94:1914) and TRP-2 (Wang et al. 1996 J. Exp. Med. 184:2207) were inserted into 
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the C5 locus. DMA sequences encoding LFA-3 (Wallner, et'aL (1987) J. Exp. Med.i66:923- 
932), ICAM-1 (Staunton, et al. (1988) Cell 52:925-933) and B7.1 (Chen, et al. (1992) .Cell 
71:1093-1102) were inserted into the C3 locus. LFA-3, ICAM-1 and B7.1 form an expression 
cassette known as TRICOM. UNA sequences encoding vvE3L (Chang, et al. 1992. Proc. Natl. 
5 Acad. Sci. U. S. A 89:4825-4829) and wK3L (Beattie, et al. 1991. Virology 183:419-422) were 
inserted into the C6 locus. Promoters were utilized as follows: 



Table ni 



DNA sequence 


Promoter 


E3L 


vaccinia E3L 


K3L 


vaccinia H6 


LFA-3 1 


vaccinia 30K 


ICAM-1 


vaccinia 13 


B7.1 


sE/L 


NY-ESO-1 


vaccinia H6 


, TRP-2 


sE/L 



10 

Promoter sE/L is described by Chakrabarti, et al. (BioTechniques 23: 1094-1097, 1997). 

The donor plasmids utilized are shown below: 
15 . 

TablcIV 



Plasmid 


Size (bp) 


Vector 


Antibiotic 
Resitance Gene 


PMPC6H6K3E3 




pBS-SK 


Amp . 


pALVAC.Tricom(C3) #33 


10,470 


pBS-SK 


Amp 


pT1132 


11,154 


pBS-SK 


Amp 



NY-ESO-1 and TRP-2 DNA sequences were inserted into the ALVAC donor plasmid 
20 pT1132. This donor plasmid was then used with pALVAC.Tricom(C3) #33 .to generate the 
ALVAC-TRICOM recombinant expressing these genes using standard techniques. The plasmids 
pALVAC.Tricom(C3) #33 and pT1132 are shown in Figure 1. The DNA sequences of 
pALVAC.Tricom(C3) #33 and pT1132 are shown in Figures 2 and 3, respectively. 
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Example 2 

Construction of the Multi-Antigen Construct vT419 
The expression vector vT419 (ALVAC-gplOOM/Mart-1/ Mage-1,3 minigene-LFA- 
3/ICAM-1/B7.1-E3L/K3L) was constructed in the ALVAC vector using standard techniques. 
DNA sequences encoding the gplOOM/MART-l/MAGE-1,3 minigene, LFA-3, ICAM-1, B7.1, 
wE3L and wK3L were inserted into various loci within the ALVAC genome. The gplOOM/ 
MART- 1 /MAGE- 1 ,3 mini gene was inserted into the C5 locus. DNA sequences encoding LFA-3 
(Wallner, et al. (1987) J. Exp. Med. 166:923-932), ICAM-1 (Staunton, et al. (1988) Cell 
52:925-933) and B7.1 (Chen, et al. (1992) Cell 71:1093-1102) were inserted into the C3 locus. 
LFA-3, ICAM-1 and B7.1 form an expression cassette known as TRJCOM. DNA sequences 
encoding vvE3L (Chang, et al. 1992. Proc: Natl. Acad. Sci. U. S. A 89:4825-4829) and wK3L 
(Beattie, et al. 1991. Virology 183:419-422) were inserted into the C6 locus. Promoters were 
utilized as follows: 



Table V 



Gene 


Promoter 


E3L 


vaccinia E3L 


K3L 


vaccinia H6 


LFA-3 


vaccinia 30K 


ICAM-1 


vaccinia 13 


• B7.1 


sE/L 


gpl00(M) 


vaccinia H6 


Mart-i 


vaccinia 42K 



Promoter sE/L is described by Chakrabarti, et al. (BioTechniques 23: 1094-1097, 1997). 
The donor plasmids utilized are shown below: 



Table VI 



Plasniid 


Size (bp) 


Vector 


Antibiotic 
Resitance Gene 


• PMPC6H6K3E3 




pBS-SK 


Amp 


pALVACTricom(C3) #33 


10,470 


pBS-SK 


Amp 


PT3217 


11,465 


pBS-SK 


Amp 
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. gplO0(M), Mart-1 and Mage- 1,3 minigene were inserted into the ALVAC C5 donor 
plasmid pT3217. This donor plasmid was then used with pALVAC.Tricom(C3) #33 to generate 
the ALVAC-TRJCOM recombinant expressing these genes using standard techniques. This 
donor plasmid inserts into the C5 site. pALVAC.Tricom(C3) #33 is shown in Figures l and 2. 
5 The pT3217 plasmid is shown in Figure 4. The DNA sequence of pT3217 is shown in Figure 5. 

EXAMPLE 3 
Immunological Assessment of Multi-Antigen Vectors 
The results of the first animal experiment indicated a trend toward higher immunological 
10 • responses to three (Mart 1, NY-ESO-1 and gplOO) of the four antigens when the vaccine was 
given as two separate injections. However, these"differences were not statistically significant In 
detail, HLA-A2/K b transgenic mice (5/group) were immunized subcutaneously with vT419 
(ALVAC(2)-gpl00M/MART-l/MAGE-l/3 minigeneTfRICOM) and vT416 (ALVAC(2)-TRP- 
2/NY-ESO-l/TRICOM) either combined at one site or given as separate injections. Control mice 
15 were immunized with parental ALVAC(2). Mice were vaccinated three times (at three week 
intervals), and three weeks after the last boost T cell responses in individual mice were analyzed 
by IFN-g ELISPOT and CTL assays following in vitro restimulation with peptide. Compared to 
control animals, mice vaccinated with the multi-antigen vectors (at 2 sites) exhibited statistically 
significant ELISPOT responses against MART-1. The IFN-gamma response to gp lOOM and NY- 
20 ESO-1 were also detectable, although these responses were not statistically significant due to 
response variability and the small number of cultures tested. ELISPOT responses against the 
TRP-2 antigen were elevated in all groups tested (including control animals), presumably due to 
the fact that the dominant A2-restricted TRP-2 peptide (180-188) cross-reacts with H-2K b and 
can induce low avidity T cell responses in naive mice following in vitro, culture, and were 
25 therefore not statistically significant. Interestingly, ELISPOT responses in mice injected with an 
admixture of vT;416 and vT419 were generally lower than in mice receiving each virus 
separately, although these differences did not achieve statistical significance. The CTL data were 
largely negative, except for one strong anti-gp 100 response and one marginal anti-MART-1 
response, both of which occurred in mice vaccinated with vT416 and vT419 (two sites). Overall, 
30 these results provided encouraging data that establish that the multi-antigen vectors can generate 
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responses against MART- 1, and suggest that anti-gplOO and anti-NY-ESO-1 responses can also 
be induced. 

~ Two additional pre-clinical animal studies have been completed using the melanoma 
multi-antigen ALVAC recombinants. In these experiments, HLA-A27K b transgenic mice 
(5/group) were immunized subcutaneously with vT4l9 (ALVAC(2)-gpl00M/MART-l/MAGE- 
1/3 minigenemUCOM) and vT416 (ALVAC(2)-TRP-2/NY-ESO-l/TRJCOM) either combined 
at one site or given as separate injections. Control mice were immunized with parental 
ALVAC(2). After vaccination, the T cell responses in individual mice were assessed by IFN- 
.gamma ELISPOT assay following in vitro restimulatioh with peptide. Unlike the previous multi- 
antigen experiment, which provided encouraging immunogenicity data, the two most recent 
studies generated inconclusive data, due to high background responses in control immunized 
animals. Therefore, overall the results were deemed as inconclusive. 

To confirm the immunogenicity of the multi-antigen constructs, and to repeat results 
iiom the first study, another pre-clinical animal study has been completed.- HLA-A2/K b 
transgenic mice (10/group) were immunized subcutaneously with vT419 • (ALVAC(2)- 
gpl00M/MART-I/MAGE-l/3 minigene/TRJCOM) and vT416 (ALVAC(2)-TRP-2/NY-ESO- 
imUCOM) given as. separate injections. Control mice were immunized with parental 
ALVAC(2). Statistically significant ELISPOT responses were detectable against gplOO, Mart-1 
and TRP-2, and some responses were detected against NY-ESO-1, which were at the border of 
being statistically significant. 

While the present invention has been described in terms of the preferred embodiments, "it 
is understood that variations and modifications will occur to those skilled in the art' Therefore, it 
is intended that the appended claims cover all such equivalent variations that come within the 
scope of the invention as claimed. 
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CLAIMS 

What is claimed is: 

1. An expression vector for co-expressing at least two immunogenic targets, wherein said 
immunogenic targets are selected from the group consisting of NY-ESO-1, TRP-2, gplOO, 

5 gplOOM, a MART antigen, MART-1, a MAGE antigen, MAGE-1, and MAGE-3. ' 

2. The expression vector of claim 1 wherein the vector is a plasmid or a viral vector. 

3. The expression vector of claim 2 wherein the viral vector is selected from the group 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. • . 

4. The expression vector of claim 3 wherein the viral vector is a poxvirus .selected from the 
10 * group consisting of vaccinia, NYVAC, avipox, canarypox, ALVAC, ALVAC(2), fowlpox, 

andTROVAC. 

5. The expression vector of claim 4 wherein the viral vector is a poxvirus selected from the 
group consisting of NYVAC, ALVAC, and ALVAC(2). 

6. The expression vector of claim ] further comprising at least one nucleic sequence encoding 
.15 an angiogenesis-associated antigen. 

7. The expression vector of claim 6 wherein the vector is a plasmid or a viral vector. 

8. The expression vector of claim 7 wherein die viral vector is selected from the group 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. 

9.. The expression vector of claim 8 wherein the viral vector is a poxvirus selected from the 
20 group consisting of vaccinia, NYVAC, ayipox, canarypox, . ALVAC, ALVAC(2), fowlpox, . 
and TROVAC. 

10. The expression vector of claim 9 wherein the viral vector is a poxvirus selected from the 
group consisting of NYVAC, ALVAC, and ALVAC(2). 

1 1 . The expression vector of claim 1 or 6 further comprising at least one nucleic acid sequence 
25 encoding a co-stimulatory component. 

J 2. The expression vector of claim 1 1 wherein the vector is a plasmid or a viral vector. 

13. The expression vector of claim 12 wherein the viral vector is selected from the group 
consisting of poxvirus, adenovirus, retrovirus, herpesvirus, and adeno-associated virus. 

14. The expression vector of claim 13 wherein the viral vector is a poxvirus selected from the 
30 group consisting of vaccinia, NYVAC, avipox, canarypox, ALVAC, ALVAC(2), fowlpox, 

andTROVAC. 
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1 5. The expression vector of claim 14 wherein the viral vector is a poxvirus selected from the 
group consisting of NYVAC, ALVAC, and ALVAC(2). . 

16 The expression vector of any one claims 11-15 wherein the co-stimulatory component is 
human B7.1. ' N 

17. A composition comprising an expression vector of any one of claims 1-16 in a 
pharmaceutical^ acceptable carrier. 

1 8. A method for preventing or treating cancer comprising administering to a host an expression 
vector of any one of claims 1-16. 

.9. A method for preventing or treating cancer comprising administering to a host a composition 
of claim 17. * ■ 
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FIGURE 1 




1/53 



BKSDOCtO; <WO 2005O2C370A2 I > 



WO 2005/026370 



PCT/US2004/028751 



FIGURE 2 

DNA Sequence of pALVAGTriconi(C3) #33 

GGAAATTGTA AACGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 
CCTTTAACAT TTGCAATTAT AAAACAATTT TAAGCGCAAT TTAAAAACAA 
AAATCAGCTC ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 
TTTAGTCGAG TAAAAAATTG GTTATCCGGC TTTAGCCGTT TTAGGGAATA 
AAATCAAAAG AATAGACCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 
TTTAGTTTTC TTATCTGGCT CTATCCCAAC TCACAACAAG GTCAAACCTT 
CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 
GTTCTCAGGT GATAATTTCT TGCACCTGAG GTTGCAGTTT CCCGCTTTTT 
CCGTCTATCA GGGCGATGGC CCACTACGTG AACCATCACC CTAATCAAGT 
GGCAGATAGT CCCGCTACCG GGTGATGCAC TTGGTAGTGG GATTAGTTCA 
TTTTTGGGGT CGAGGTGCCG TAAAGCACTA AATCGGAACC CTAAAGGGAG 
AAAAACCCCA GCTCCACGGC ATTTCGTGAT TTAGCCTTGG GATTTCCCTC 
CCCCCGATTT AGAGCTTGAC GGGGAAAGCC GGCGAACGTG GCGAGAAAGG 
GGGGGCTAAA TCTCGAACTG CCCCTTTCGG CCGCTTGCAC CGCTCTTTCC 
AAGGGAAGAA AGCGAAAGGA GCGGGCGCTA GGGCGCTGGC AAGTGTAGCG 
TTCCCTTCTT TCGCTTTCCT CGCCCGCGAT CCCGCGACCG TTCACATCGC 
GTCACGCTGC GCGTAACCAC CACACCCGCC GCGCTTAATG CGCCGCTACA 
CAGTGCGACG CGCATTGGTG GTGTGGGCGG CGCGAATTAC GCGGCGATGT 
GGGCGCGTCG CGCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC 
CCCGCGCAGC GCGGTAAGCG GTAAGTCCGA CGCGTTGACA ACCCTTCCCG 
GATCGGTGCG GGCCTCTTCG CTATTACGCC, AGCTGGCGAA AGGGGGATGT 
CTAGCCACGC .CCGGAGAAGC GATAATGCGG TCGACCGCTT TCCCCCTACA 
GCTGCAAGGC GATTAAGTTG GGTAACGCCA GGGTTTTCCC AGTCACGACG 
CGACGTTCCG CTAATTCAAC C.CATTGCGGT CCCAAAAGGG TCAGTGCTGC 
TTGTAAAACG ACGGCCAGTG AATTGTAATA CGACTCACTA TAGGGCGAAT 
AACATTTTGC TGCCGGTCAC TTAACATTAT GCTGAGTGAT ATCCCGCTTA 
TGGGTACCGC GGGCGCGTCG ACATGCATTG TTAGTTCTGT AGATCAGTAA 
ACCCATGGCG CCGGCGCAGC TGTACGTAAC AATCAAGACA TCTAGTCATT 

Left Arm • . 

701 CGTATAGCAT ACGAGTATAA TTATCGTAGG TAGTAGGTAT CCTAAAATAA 
GCATATCGTA TGCTCATATT AATAGCATCC ATCATCCATA GGATTTTATT 

Left Arm 

751 ATCTGATACA GATAATAACT TTGTAAATCA ATTCAGCAAT" TTCTCTATTA 
TAGACTATGT CTATTATTGA AACATTTAGT TAAGTCGTTA AAGAGATAAT 

Left Arm 

801 TCATGATAAT GATTAATACA CAGCGTGTCG TTATTTTTTG TTACGATAGT 
AGTACTATTA CTAATTATGT GTCGCACAGC AATAAAAAAC AATGCTATCA 

Left Arm 

851 ATTTCTAAAG TAAAGAGCAG GAATCCCTAG TATAATAGAA ATAATCCATA 
TAAAGATTTC ATTTCTCGTC CTTAGGGATC ATATTATCTT TATTAGGTAT 

Left Arm ' 

. .901 TGAAAAATAT. AGTAATGTAC ATATTTCTAA TGTTAACATA TTTATAGGTA 
ACTTTTTATA TCATTACATG TATAAAGATT ACAATTGTAT AAATATCCAT 

Left Arm 

951 AATCCAGGAA GGGTAATTTT TACATATCTA TATACGCTTA TTACAGTTAT 
TTAGGTCCTT CCCATTAAAA ATGTATAGAT ATATGCGAAT AATGTCAATA 



. 5 
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5 


1001 


Left Arm * 

taaaaatAta cttgcaaaca'tgttagaagt aaaaaagaaa gaactaattt 
atttttatat gaacgtttgt acaatcttca ttttttcttt cttgattaaa 




1 ftM 


Left Arm 

JAUAAAblbv* rTTACCAAAA TGCCAATGGA AATTACTTAG TATGTATATA 
ATGTTTCACG AAATGGTTTT ACGGTTACCT TTAATGAATC ATACATATAT 


10 




Left Arm 

ATGTATAAAG GTATGAATAT CACAAACAGC AAATCGGCTA TTCCCAAGTT 
TACATATTTC CATACTTATA GTGTTTGTCG TTTAGCCGAT AAGGGTTCAA 


i e 




Left Arm 




' 1151 


GAGAAACGGT ATAATAGATA TATTTCTAGA TACCATTAAT AACCTTATAA 
CTCTTTGCCA TATTATCTAT ATAAAGATCT ATGGTAATTA TTGGAATATT 


20 


1201 


Left Arm 

GCTTGACGTT TCCTATAATG CCTACTAAGA AAACTAGAAG ATACATACAT 
CGAACTGCAA AGGATATTAC GGATGATTCT TTTGATCTTC TATGTATGTA 


25 


1251 


Left Arm 

ACTAACGCCA TACGAGAGTA ACTACTCATC GTATAACTAC TGTTGCTAAC 
TGATTGCGGT ATGCTCTCAT TGATGAGTAG CATATTGATG ACAACGATTG 


30 


1301 


Left Arm 

AGTGACACTG ATGTTATAAC TCATCTTTGA TGTGGTATAA ATGTATAATA 
TCACTGTGAC TACAATATTG AGTAGAAACT ACACCATATT TACATATTAT 


j j 


1351 


Left Arm 

ACTATATTAC ACTGGTATTT TATTTCAGTT ATATACTATA TAGTATTAAA 
TGATATAATG TGACCATAAA ATAAAGTCAA TATATGATAT ATCATAATTT 


1401 


Left Arm 

AA.TTATATTT GTATAATTAT ATTATTATAT TCAGTGTAGA AAGTAAAATA 
TTAATATAAA CATATTAATA TAATAATATA AGTCACATCT TTCATTTTAT 


40 


1451 ■ 


Left Arm 

CTATAAATAT GTATCTCTTA TTTATAACTT ATTAGTAAAG TATGTACTAT 
GATATTTATA CATAGAGAAT AAATATTGAA TAATCATTTC ATACATGATA . 


45 


1501 


Left Arm 

TCAGTTATAT TGTTTTATAA AAGCTAAATG CTACTAGATT GATATAAATG 
AGTCAATATA ACAAAATATT TTCGATTTAC GATGATCTAA CTATATTTAC 


50 


1551 


Left Arm 

AATATGTAAT AAATTAGTAA TGTAGTATAC TAATATTAAC TCACATTTGA 
TTATACATTA TTTAATCATT ACATCATATG ATTATAATTG AGTGTAAACT 



Left Arm 

30K Pr. 



55 " 1601 CTAATTAGCT ATAAAAACCC TAAGGTAGGC GGCCGCACTA GAGGATTCGA 
GATTAATCGA TATTTTTGGG ATTCCATCCG CCGGCGTGAT CTCCTAAGCT 
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30K Pr. . 

1651 CAAACACCAA TAATTCCCTT CTCTTCATTC CGGACATTAA ATTGGCTATA 
GTTTGTGGTT ATTAAGGGAA GAGAAGTAAG GCCTGTAATT TAACCGATAT 
30K Pr. 

1701 GATAATAAAG ACATTGAGAT GTTACAGGCT CTGTTCAAAT ACGACATTAA 
CTATTATTTC TGTAACTCTA CAATGTCCGA GACAAGTTTA TGCTGTAATT. 
30K Pr, 



1751 TATCTATTCT GCTAATCTGG AAAATGTACT ATTGGATGAT GCCGAAATAG 
ATAGATAAGA CGATTAGACC TTTTACATGA TAACCTACTA CGGCTTTATC 
30K Pr. 

1801 CTAAAATGAT TATAGAAAAG CATGTTGAAT ACAAGTCTGA CTCCTATACA 
GATTTTACTA ATATCTTTTC GTACAACTTA TGTTCAGACT GAGGATATGT 
30K Pr. 

1851 AAAGATCTCG ATATAGTCAA GAATAATAAA TTGGATGAAA TAATTAGCAA 
TTTCTAGAGC TATATCAGTT CTTATTATTT AACCTACTTT' ATTAATCGTT 
30K Pr. 



1901 AAACAAGGAA CTCAGACTCA TGTACGTCAA TTGTGTAAAG AAAAACTAAT 
TTTGTTCCTT GAGTCTGAGT ACATGCAGTT AACACATTTC TTTTTGATTA 
30K Pr. 

1951 TAGATTCTCC CACATTTTTG TTAACATTAC ACTMCTMT^TGGTAAAATT 
• ATCTAAGAGG GTGTAAAAAC AATTGTAATG TGATTGATTA ACCATTTTAA 

30K Pr. 

2001 • GATAGAATAA TTATGTGTAT ATAAGATAGA TTTCCTATTG TCTTACTCAT 
CTATCTTATT AATACACATA TATTCTATCT AAAGGATAAC AGAATGAGTA 
30K Pr. 



hLFA-3 % 

2051 TGCATCGTGG GAATTCAGAT CAGCTTCCGC GGCATGGTTG CTGGGAGCGA 
ACGTAGCACC CTTAAGTCTA GTCGAAGGCG CCGTACCAAC GACCCTCGCT 
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hLFA-3 



2101 CGCGGGGCGG GCCCTGGGGG TCCTCAGCGT GGTCTGCCTG CTGCACTGCT 
GCGCCCCGCC CGGGACCCCC AGGAGTCGCA CCAGACGGAC GACGTGACGA " 

hLFA-3 



2151 TTGGTTTCAT CAGCTGTTTT TCCCAACAAA TATATGGTGT TGTGTATGGG 
AACCAAAGTA GTCGACAAAA AGGGTTGTTT ATATACCACA ACACATACCC 

hLFA-3 • 



2201 AATGTAACTT TCCATGTACC AAGCAATGTG CCTTTAAAAG AGGTCCTATG 
TTACATTGAA AGGTACATGG TTCGTTACAC GGAAATTTTC TCCAGGATAC 

hLFA-3 



2251 GAAAAAACAA AAGGATAAAG TTGCAGAACT GGAAAATTCT GAATTCAGAG 
CTTTTTTGTT TTCCTATTTC AACGTCTTGA CCTTTTAAGA CTTAAGTCTC • 

hLFA-3 



2301 CTTTCTCATC TTTTAAAAAT AGGGTTTATT TAGACACTGT GTCAGGTAGC 
GAAAGAGTAG AAAATTTTTA TCCCAAATAA ATCTGTGACA CAGTCCATCG 

hLFA-3 



2351 CTCACTATCT ACAACTTAAC ATCATCAGAT GAAGATGAGT ATGAAATGGA 
GAGTGATAGA TGTTGAATTG TAGTAGTCTA CTTCTACTCA TACTTTACCT 

hLFA-3 

24 01 ATCGCCAAAT ATTACTGATA CCATGAAGTT CTTTCTTTAT GTGCTTGAGT 
TApCGGTTTA TAATGACTAT GGTACTTCAA GAAAGAAATA CACGAACTCA 

hLFA-3 



24 51 CTCTTCCATC TCCCACACTA ACTTGTGCAT TGACTAATGG AAGCATTGAA 
GAGAAGGTAG AGGGTGTGAT TGAACACGTA ACTGATTACC TTCGTAACTT 

hLFA-3 



2501 GTCCAATGCA TGATACCAGA GCATTACAAC AGCCATCGAG GACTTATAAT 
CAGGTTACGT ACTATGGTCT CGTAATGTTG TCGGTAdCTC CTGAATATTA 

hLFA-3 



2551 GTACTCATGG GATTGTCCTA TGGAGCAATG TAAACGTAAC TCAACCAGTA 
CATGAGTACC CTAACAGGAT ACCTCGTTAC ATTTGCATTG AGTTGGTCAT 

hLFA-3 



2601 TATATTTTAA GATGGAAAAT GATCTTCCAC AAAAAATACA GTGTACTCTT 
ATATAAAATT CTACCTTTTA CTAGAAGGTG TTTTTTATGT CACATGAGAA ' 

hLFA-3 



2651 AGCAATCCAT TATTTAATAC AACATCATCA ATCATTTTGA CAACCTGTAT 
-TCGTTAGGTA ATAAATTATG TTGTAGTAGT TAGTAAAACT GTTGGACATA 

hLFA-3 



2701 CCCAAGCAGC GGTCATTCAA GACACAGATA TGCACTTATA CCCATACCAT 
GGGTTCGTCG CCAGTAAGTT CTGTGTCTAT ACGTGAATAT GGGTATGGTA 

. hLFA-3 

2751 TAGCAGTAAT TACAACATGT ATTGTGCTGT ATATGAATGG TATTCTGAAA 
. ATCGTCATTA ATGTTGTACA TAACACGACA TATACTTACC ATAAGACTTT 
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hLFA-3 J3 pr< 


5 


2801 


TGTGACAGAA AACCAGACAG AACCAACTCC AATTGATTGG CTCGACCPrr 
ACACTGTCTT TTGGTCTGTC TTGGTTGAGG TtSSSS GAgSScC 

13 Pr. 


10 


2851' 


AAT^TACTAT CTAC6TACGA AACCCGCATC CGCTCCCATT CAATTCACAT 
TTACATGATA GATGCATGCT TTGGGCGTAG GCGAGGGTAA GTTmSgW 

13 Pr. 




2901 


' I^r^ GA TAAAATAA AA CCACTGGTGG TTTGCGATTC CGAAATCTGT 

acctgttcct attttatttt ggtgaccacc aaacgctaag gctogIS 

13 Pr. 


1 c 

I J 


2951 ' 


gtggttaaac aaaaacattt ttattctcaa atgagataaa 
tgtagtacgt caccaatttg tttttgtaaa aataagagtt TAcSm 

13 Pr. 


20 


3001 


r^™I* T a ™ tcatta t attacaaagt acaattattt aggtttaatc 

CACTTTTATA TATAGTAATA TAATGTTTCA TGTTAATAAA TCCMAtSg 
IJ Fr * hICAM 


25 


3051 


SSSc CCAGCAGC <* CCGGCCCGCG CTGCCCGCAC 
TTAGGGCGCC cgataccgag GGTCGTCGGG GGCCGGGCGC GACGGGCGTG 

hi CAM 



30 



50 



3101 



3151 



35 .3201 



TCCTGGTCCT GCTCGGGGCT CTGTTCCCAG GACCTGGCAA TGCCCAGACA 

aggaccagga cgagccccga gacaagggtc ctggaccgtt acgggtcJS 

hICAM 

TCTGTGTCCC CCTCAAAAGT CATCCTGCCC^CGGGGAG^ 

agacacaggg ggagttttca gtaggacggg -gSSSS SSSS 

hICAM 

GACATGCAGC ACCTCCTGTG ACCAGCCCAA "gTTGTTGGGC ~ATAGAGArrr 
CTGTACGTCG TGGAGGACAC TGGTCGGGTT CAACAACCCG " 

hICAM 



AO rrla^™ AAAGGAGTTG CTCCTGCCTG GGAACAACCG~GMGGTCTW 

10 GCAACGGATT TTTCCTCAAC GAGGACGGAC CCTTGTTGGC CTTCCACATa' 

hi CAM 

3301 GAACTGAGCA ATGTGCAAGA AGATAGCCAA~CCAATGTGCT ~ATTCAAACTr ' 
^ CTTGACTCGT TACACGTTCT TCTATCGGTT GGTtIcaSa SSJSS ' 



hICAM 



3351 



rr^I^ TGGG CAGTCAA CAG CTAAAACCTT CCTCACCGTG TACTGGAC^C 

gggactaccc gtcagttgtc gattttggaa ggagtggcac atgaSgIg 

hi CAM 

3401 ^~° GGGT GGAACT <SGCA CCCCTCCCCT^CTTGGCAGCC ^AGTGGGCAAG 
GTCTTGCCCA CCTTGACCGT GGGGAGGGGA GAACCGTCGG TCACdCGTTC 

. hICAM 



55 • 34-5 1 



AACCTTACCC TACGCTGCCA GGTGGAGGGT GGGGCACCCC^GGGCCAACCT 
TTGGAATGGG ATGCGACGGT CCACCTCCCA CCCCGTGGGG SStoS 
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hi CAM 



5 


3501 


CACCGTGGTG 
GTGGCACCAC 


CTGCTCCGTG 
GACGAGGCAC 


GGGAGAAGGA 
CCCTCTTCCT 
hICAM 


GCTGAAACGG 
CGACTTTGCC 


GAGCCAGCTG 
CTCGGTCGAC 


10 


3551 


TGGGGGAGCC 
ACCCCCTCGG 


CGCTGAGGTC 
GCGACTCCAG 


ACGACCACGG 
TGCTGGTGCC 
hi CAM 


TGCTGGTGAG 
ACGACCACTC 


GAGAGATCAC 
CTCTCTAGTG 


3601 


CATGGAGCCA ATTTCTCGTG 
GTACCTCGGT TAAAGAGCAC 


CCGCACTGAA 
GGCGTGACTT 
hICAM 


CTGGACCTGC 
GACCTGGACG 


GGCCCCAAGG 
CCGGGGTTCC 


15 


3651 


GCTGGAGCTG 
CGACCTCGAC 


TTTGAGAACA 
AAACTCTTGT 


CCTCGGCCCC 
GGAGCCGGGG 
hICAM' 


CTACCAGCTC 
GATGGTCGAG 


CAGACCTTTG 
GTCTGGAAAC 


.20 


3701 


TCCTGCCAGC GACTCCCCCA 
AGGACGGTCG CTGAGGGGGT 


CAACTTGTCA 
GTTGAACAGT 
hi CAM 


GCCCCCGGGT 
CGGGGGCCCA 


CCTAGAGGTG 
GGATCTCCAC 


25 


3751 


. GACACGCAGG GGACCGTGGT CTGTTCCCTG GACGGGCTGT 
CTGTGCGTCC CCTGGCACCA GACAAGGGAC CTGCCCGACA 
. hi CAM 


TCCCAGTCTC 
AGGGTCAGAG 


30 


3801 


GGAGGCCCAG 
CCTCCGGGTC 


GTCCACCTGG 
CAGGTGGACC 


CACTGGGGGA 
GTGACCCCCT 
hi CAM 


CCAGAGGTTG 
GGTCTCCAAC 


AACCCCACAG 
TTGGGGTGTC 


3851 


• TCACCTATGG 
AGTGGATACC 


CAACGACTCC 
GTTGCTGAGG 


TTCTCGGCCA 
AAGAGCCGGT 
hICAM 


AGGCCTCAGT 
TCCGGAGTCA 


CAGTGTGACC 
GTCACACTGG 


35 


3901 


GCAGAGGACG 
CGTCTCCTGC 


AGGGCACCCA GCGGCTGACG TGTGCAGTAA TACTGGGGAA 
TCCCGTGGGT CGCCGACTGC ACACGTCATT ATGACCCCTT 
hICAM 


40 


3951 


CCAGAGCCAG 
GGTCTCGGTC 


GAGACACTGC 
CTCTGTGACG 


AGACAGTGAC 
TCTGTCACTG 
hICAM 


CATCTACAGC 
GTAGATGTCG 


TTTCCGGCGC 
AAAGGCCGCG 


45 


1001 


CCAACGTGAT TCTGACGAAG 
GGTTGCACTA AGACTGCTTC 


CCAGAGGTCT 
GGTCTCCAGA 
hICAM 


CAGAAGGGAC 
GTCTTCCCTG 


CGAGGTGACA 
GCTCCACTGT 


50 


. 4051 


. GTGAAGTGTG 
CACTTCACAC 


AGGCCCACCC 
TCCGGGTGGG 


TAGAGCCAAG 
ATCTCGGTTC 
hi CAM . 


GTGACGCTGA 
CACTGCGACT 


ATGGGGTTCC 
TACCCCAAGG 


U01 


AGCCCAGCCA' CTGGGCCCGA 
TCGGGTCGGT GACCCGGGCT 


GGGCCCAGCT 
CCCGGGTCGA 
hICAM 


CCTGCTGAAG 
GGACGACTTC 


GCCACCCCAG 
CGGTGGGGTC 


55 


4151 


AGGACAACGG 
TCCTGTTGCC 


GCGCAGCTTC 
CGCGTCGAAG 


TCCTGCTCTG 
AGGACGAGAC 


CAACCCTGGA 
GTTGGGACCT 


GGTGGCCGGC 
CCACCGGCCG 
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hICAM 

4 201 CAGCTTAfAC ACAAGAACCA- GACCCGGGAG CTTCGTGTCC TGTATGGCCC 
GTCGAATATG TGTTCTTGGT CTGG6CCCTC GAAGCACAGG ACATACCGGG " 
5 hICAM 

4251 CCGACTGGAC GAGAGGGATT GTCCGGGAAA CTGGACGTGG CCAGAAAAT^ 
GGCTGACCTG CTCTCCCTAA CAGGCCCTTT GACCTGCACC GGTCTTTTAA 

hICAM 

10 

4301 ■ CCCAGCAGAC TCCAATGTGC CAGGCTTGGG GGAACCCATT GCCCGAGCTC 
GGGTCGTCTG AGGTTACACG GTCCGAACCC CCTTGGGTAA CGGGCTCGAG 

hICAM 



15 4351' 



25 



45 



AAGTGTCTAA AGGATGGCAC TTTCCCACTG CCCATCGGGG AATCAGTGAC 
TTCACAGATT TCCTACCGTG AAAGGGTGAC GGGTAGCCCC TTAGTCACTG 

hICAM 



4401 TGTCACTCGA GATCTTGAGG GCACCTACCT CTGTCGGGCC AGGAGCACTC 
20 ACAGTGAGCT CTAGAACTCC CGTGGATGGA GACAGCCCGG TCCTCGTGAG 

hi CAM 



44 51 AAGGGGAGGT CACCCGCGAG GTGACCGTGA ATGTGCTCTC CCCCCGGTAT 
TTCCCCT.CCA GTGGGCGCTC CACTGGCACT TACACGAGAG GGGGGCCATA" 
• hICAM 



4 501 GAGATTGTCA TCATCACTGT GGTAGCAGCC GCAGTCATAA TGGGCACTGC 
CTpTAACAGT AGTAGTGACA CCATCGTCGG CGTCAGTATT ACCCGTGACG 

hi CAM 

30 * 

4551 AGGCCTCAGC ACGTACCTCT ATAACCGCCA GCGGAAGATC AAGAAATACA 
TCCGGAGTCG TGCATGGAGA TATTGGCGGT CGCCTTCTAG TTCTTTATGT 

hICAM 

35 4 601 GACTACAACA GGCCCAAAAA GGGACCCCCA TGAAACCGAA ' CACACAAGCC 
CTGATGTTGT CCGGGTTTTT CCCTGGGGGT ACTTTGGCTT GTGTGTTCGG 
hi CAM sE/L p r> 



4 651 ACGCCTCCCT GAGCATGCAT GTAGCTTAAA AATTGAAATT TTATTTTTTT 
40 TGCGGAGGGA CTCGTACGTA CATCGAATTT TTAACTTTAA AATAAAAAAa' 

sE/L Pr. • 



4 7 01 TTTTTGGAAT ATAAATAAGC TCGAAGTCGA AATTCCTGCA GCCCGGGGCC ' 
AAAAACCTTA TATTTATTCG AGCTTCAGCT TTAAGGACGT CGGGCCCCGG 

hB7.1 



4751 ATGGGCCACA CACGGAGGCA GGGAACATCA CCATCCAAGT GTCCATACCT 
TACCCGGTGT GTGCCTCCGT CCCTTGTAGT GGTAGGTTCA CAGGTATGGA 

hB7 . 1 

50 r ~ „ 

4 801 ' CAATTTCTTT CAGCTCTTGG TGCTGGCTGG TCTTTCTCAC TTCTGTTCAG 
GTTAAAGAAA GTCGAGAACC ACGACCGACC AGAAAGAGTG AAGACAAGTC 

hB7 . 1 



55 ' . 4851 



GTGTTATCCA CGTGACCAAG GAAGTGAAAG AAGTGGCAAC GCTGTCCTGT 
CACAATAGGT GCACTGGTTC CTTCACTTTC TTCACCGTTG COACAGGACA 
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hB?.l 

4 901 GGTCACAATG TTTCTGTTGA AGAGCTGGCA CAAACTCGCA TCTACTGGCA 
CCAGTGTTAC AAAGACAACT TCTCGACCGT GTTTGAGCGT AGATGACCGT 

hB7.1 

4 951 AAAGGAGAAG AAAATGGTGC TGACTATGAT GTCTGGAGAC ATGAATATAT 
TTTCCTCTTC TTTTACCACG ACTGATACTA CAGACCTCTG TACTTATATA 

hB7 . 1 

• 5001 GGCCCGAGTA CAAGAACCGG ACCATCTTTG ATATCACTAA TAACCTCTCC 
CCGGGCTCAT GTTCTTGGCC TGGTAGAAAG TATAGTGATT ATTGGAGAGG 

hB7 . 1 

15 5051 ATTGTGATCC TGGCTCTGCG CCCATCTGAC GAGGGCACAT ACGAGTGTGT 
TAACACTAGG ACCGAGACGC GGGTAGACTG CTCCCGTGTA TGCTCACACA 

hB7 . 1 



JO 



20 



5101 TGTTCTGAAG TATGAAAAAG ACGCTTTCAA GCGGGAACAC CTGGCTGAAG 
ACAAGACTTC ATACTTTTTC TGCGAAAGTT CGCCCTTGTG ' GACCGACTTC 

hB7.1 



5151 TGACGTTATC AGTCAAAGCT GACTTCCCTA CACCTAGTAT ATCTGACTTT 
ACTGCAATAG TCAGTTTCGA CTGAAGGGAT GTGGATCATA TAGACTGAAA 
25 hB7.1 

5201 GAAATTCCAA CTTCTAATAT TAGAAGGATA ATTTGCTCAA CCTCTGGAGG 
CTTTAAGGTT GAAGATTATA ATCTTCCTAT TAAACGAGTT GGAGACCTCC 

hB7 . 1 

30 . 

5251 TTTTCCAGAG CCTCACCTCT CCTGGTTGGA AAATGGAGAA GAATTAAATG 
AAAAGGTCTC GGAGTGGAGA GGACCAACCT TTTACCTCTT CTTAATTTAC 

hB7.1 

35 • 5301 CCATCAACAC AACAGTTTCC CAAGATCCTG AAACTGAGCT CTATGCTGTT 
GGTAGTTGTG TTGTCAAAGG GTTCTAGGAC TTTGACTCGA GATACGACAA 

hB7.1 

5351 AGCAGCAAAC TGGATTTCAA TATGACAACC AACCACAGCT TCATGTGTCT 
4 0 TCGTCGTTTG ACCTAAAGTT ATACTGTTGG TTGGTGTCGA AGTACACAGA 

hB7.1 

54 01- CATCAAGTAT GGACATTTAA GAGTGAATCA GACCTTCAAC TGGAATACAA 
GTAGTTCATA CCTGTAAATT CTCACTTAGT CTGGAAGTTG ACCTTATGTT 
45 hB7.1 

54 51 CCAAGCAAGA GCATTTTCCT GATAACCTGC TCCCATCCTG GGCCATTACC 
GGTTCGTTCT CGTAAAAGGA CTATTGGACG AGGGTAGGAC CCGGTAATGG 

hB7.1 . 

50 - L _ 

• • 5501 TTAATCTCAG' TAAATGGAAT TTTCGTGATA TGCTGCCTGA CCTACTGCTT * 
AATTAGAGTC ATTTACCTTA AAAGCACTAT ACGACGGACT GGATGACGAA 

hB7.1 

55 5551 ' TGCCCCACGC TGCAGAGAGA GAAGGAGGAA TGAGAGATTG AGAAGGGAAA 
ACGGGGTGCG ACGTCTCTCT CTTCCTCCTT ACTCTCTAAC TCTTCCCTTT 
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hB7.1 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



5601 GTGTACGCCC TGTATAAAAG CTTTCTAGGT TTTTGTTTAG GGCTGCAGGA 
CACATGCGGG ACATATTTTC GAAAGATCCA AAAACAAATC CCGACGTCCT ■ 

5651 ATTCCTCGAG GGATCCCGAT TTTTATGACT AGTTAATCAA ATAAAAAGCA 
TAAGGAGCTC CCTAGGGCTA AAAATACTGA TCAATTAGTT TATTTTTCGT 

57 01 TACAAGCTAT TGCTTCGCTA TCGTTACAAA ATGGCAGGAA TTTTGTGTAA^^ 
ATGTTCGATA ACGAAGCGAT AGCAATGTTT TACCGTCCTT AAAACACATT 

Right Arm 

57 51 ACTAAGCCAC ATACTTGCCA ATGAAAAAAA TAGTAGAAAG GATACTATTT 
TGATTCGGTG TATGAACGGT TACTTTTTTT ATCATCTTTC CTATGATAAA ' 

Right Arm 

5801 TAATGGGATT AGATGTTAAG GTTCCTTGGG ATTATAGTAA CTGGGCATCT 
ATTACCCTAA TCTACAATTC CAAGGAACCC TAATATCATT GACCCGTAGA 

' ■ Right Arm 

5851 GTTAACTTTT ACGACGTTAG GTTAGATACT GATGTTACAG ATTATAATAA 
CAATTGAAAA TGCTGCAATC CAATCTATGA CTACAATGTC TAATATTATT. 

Right Arm 

5901 TGTTACAATA AAATACATGA CAGGATGTGA TATTTTTCCT CATATAACTC 

A'CAATGTTAT TTTATGTACT GTCCTACACT ATAAAAAGGA GTATATTGAG \ ' 

Right Arm 

5951 TTGGAATAGC AAATATGGAT CAATGTGATA GATTTGAAAA TTTCAAAAAG 
AACCTTATCG TTTATACCTA GTTACACTAT CTAAACTTTT AAAGTTTTTC 

Right Arm 

6001 CAAATAACTG ATCAAGATTT ACAGACTATT TCTATAGTCT GTAAAGAAGA 
GTTTATTGAC TAGTTCTAAA TGTCTGATAA AGATATCAGA CATTTCTTCT 

Right Arm 

6051 GATGTGTTTT CCTCAGAGTA ACGCCTCTAA ACAGTTGGGA GCGAAAGGAT 
CTACACAAAA GGAGTCTCAT TGCGGAGATT TGTCAACCCT CGCTTTCCTA 

Right Arm 

6101 GCGCTGTAGT TATGAAACTG GAGGTATCTG ATGAACTTAG AGCCCTAAGa' 
• CGCGACATCA ATACTTTGAC CTCCATAGAC TACTTGAATC TCGGGATTCT 

Right Arm 

6151 AATGTTCTGC TGAATGCGGT ACCCTGTTCG AAGGACGTGT TTGGTGATAT 
TTACAAGACG ACTTACGCCA TGGGACAAGC TTCCTGCACA AACCACTATA 

Right Arm 

6201 CACAGTAGAT AATCCGTGGA ATCCTCACAT AACAGTAGGA TATGTTAAGG 
GTGTCATCTA TTAGGCACCT TAGGAGTGTA TTGTCATCCT ATACAATTCC 

Right Arm 

6251 AGGACGATGT CGAAAACAAG AAACGCCTAA TGGAGTGCAT GTCCAAGTTT 
TCCTGCTACA GCTTTTGTTC TTTGCGGATT ACCTCACGTA CAGGTTCAAA 

Right Arm ■ 
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6301 AGGGGGCAAG AAATACAAGT TCTAGGATGG TATTAATAAG TATCTAAGTA 
TCCCCCGTTC TTTATGTTCA AGATCCTACC ATAATTATTC ATAGATTCAT 



5 


6351 


TTTGGTATAA 
AAACCATATT 


Right Arm 
TTTATTAAAT AGTATAATTA 
AAATAATTTA TCATATTAAT 


TAACAAATAA 
ATTGTTTATT 


TAAATAACAT 
ATTTATTGTA 


10 


6101 


GATAACGGTT 
• CTATTGCCAA 


Right Arm 
TTTATTAGAA XAAAATAGAG 
AAATAATCTT ATTTTATCTC 


ATAATATCAT 
TATTATAGTA 


AATGATATAT 
TTACTATATA 


15 


. 6451 


Right,. Arm 
AATACTTCAT- TACCAGAAAT GAGTAATGGA 
TTATGAAGTA ATGGTCTTTA CTCATTACCT 


AGACTTATAA 
TCTGAATATT 


ATGAACTGCA 
TACTTGACGT 


6501 


TAAAGCTATA 
ATTTCGATAT 


Right Arm 
AGGTATAGAG ATATAAATTT 
TCCATATCTC TATATTTAAA 


AGTAAGGTAT 
TCATTCCATA 


ATACTTAAAA 
TATGAATTTT 


20 


• 6551 


AATGCAAATA 
TTACGTTTAT 


Right Arm 
CAATAACGTA AATATACTAT 
GTTATTGCAT TTATATGATA 


CAACGTCTTT 
GTTGCAGAAA 


GTATTTAGCC 
CATAAATCGG 


25 


6601 


GTAAGTATTT 
CATTCATAAA 


Right Arm 
CTGATATAGA AATGGTAAAA 
GACTATATCT TTACCATTTT 


TTATTACTAG 
AATAATGATC 


AACACGGTGC 
TTGTGCCACG 


30 


6651 


CGATATTTTA 
. GCTATAAAAT 


Right Arm 
AAATGTAAAA ATCCTCCTCT 
TTTACATTTT TAGGAGGAGA 


TCATAAAGCT 
AGTATTTCGA 


GCTAGTTTAG 
CGATCAAATC 


35 


6701 


■ ATAATACAGA 
TATTATGTCT 


Right Arm 
AATTGCTAAA CTACTAATAG 
TTAACGATTT GATGATTATC 


ATTCTGGGGC 
TAAGACCGCG 


TGACATAGAA 
ACTGTATCTT 


6751 


CAGATACATT 
GTCTATGTAA 


Right Arm 
CTGGAAATAG TCCGTTATAT 
GACCTTTATC AGGCAATATA 


ATTTCTGTAT 
TAAAGACATA 


ATAGAAACAA 
TATCTTTGTT 


40 


6801 


TAAGTCATTA 
ATTCAGTAAT 


Right Arm 
ACTAGATATT TATTAAAAAA 
TGATCTATAA ATAATTTTTT 


AGGTGTTAAT 
TCCACAATTA 


TGTAATAGAT 
ACATTATCTA 


45 


• 6851 


Right Arm 
TCTTTCTAAA TTATTACGAT GTACTGTATG 
AGAAAGATTT AATAATGCTA CATGACATAC 


ATAAGATATC TGATGATATG 
TATTCTATAG ACTACTATAC 


50 


6901 


TATAAAATAT 
ATATTTTATA 


Right Arm * 
TTATAGATTT TAATATTGAT 
AATATCTAAA ATTATAACTA 


CTTAATATAC 
GAATTATATG 


AAACTAGAAA 
TTTGATCTTT 


55 


6951 


Right Arm 
TTTTGAAACT CCGTTACATT ACGCTATAAA 
AAAACTTTGA GGCAATGTAA TGCGATATTT 


GTATAAGAAT ATAGATTTAA 
CATATT.CTTA TATCTAAATT 



Right Arm \ 
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7001 TTAGGATATT GTTAGATAAT AGTATTAAAA TAGATAAAAG TTTATTTTTG 
AATCCTATAA CAATCTATTA TCATAATTTT ATCTATTTTC AAATAAAAAC 

Right Arm 

7051 CATAAACAGT ATCTCATAAA GGCACTTAAA AATAATTGTA GTTACGATAT 
GTATTTGTCA TAGAGTATTT CCGTGAATTT TTATTAACAT CAATGCTATA 

Right Arm -v ,,' 

7101 AATAGCGTTA CTTATAAATC ACGGAGTGCC TATAAACGAA CAAGATGATT 
TTATCGCAAT GAATATTTAG TGCCTCACGG ATATTTGCTT GTTCTACTAA 

Right Arm 

7151 TAGGTAAAAC- CCCATTACAT CATTCGGTAA TTAATAGAAG AAAAGATGTA 
ATCCATTTTG GGGTAATGTA GTAAGCCATT AATTATCTTC TTTTCTACAT 

. Right Arm 

7201 ACAGCACTTC TGTTAAATCT AGGAGCTGAT ATAAACGTAA TAGATGACTG 
TGTCGTGAAG ACAATTTAGA TCCTCGACTA TATTTGCATT. ATCTACTGAC 

Right Arm 

' 7251 S^ CAGT CCCTTACATT ACGCTGTTTC ACGTAACGAT ATCGAAACAA 
ATACCCGTCA GGGAATGTAA TGCGACAAAG TGCATTGCTA TAGCTTTGTT 

Right .Arm 

7301- CAAAGACACT TTTAGAAAGA GGATCTAATG TTAATGTGGT TAATAATCAT 
GTTTCTGTGA AAATCTTTCT CCTAGATTAC AATTACACCA ATTATTAGTA 

Right Arm 

7351 ATAGATACCG TTCTAAATAT AGCTGTTGCA TCTAAAAACA AAACTATAGT 
TATCTATGGC AAGATTTATA TCGACAACGT AGATTTTTGT TTTGATATCA 

Right Arm 

74 01 AAACTTATTA CTGAAGTACG GTACTGATAC AAAGTTGGTA. GGATTAGATA 
TTTGAATAAT GACTTCATGC CATGACTATG TTTCAACCAT CCTAATCTAT 

Right Arm 

74 51 AACATGTTAT TCACATAGCT ATAGAAATGA AAGATATTAA TATACTGAAT 
TTGTACAATA AGTGTATCGA TATCTTTACT TTCTATAATT ATATGACTTA 

Right Arm 

7501 GCGATCTTAT TATATGGTTG CTATGTAAAC GTCTATAATC ATAAAGGTTT 
CGCTAGAATA ATATACCAAC GATACATTTG CAGATATTAG TATTTCCAAA 

„ e ,. Right Arm 

7551 . CACTCCTCTA TACATGGCAG TTAGTTCTAT GAAAACAGAA TTTGTTAAAC 
GTGAGGAGAT ATGTACCGTC AATCAAGATA CTTTTGTCTT AAACAATTTG 

Right Arm ' 

7601 TCTTACTTGA CCACGGTGCT TACGTAAATG CTAAAGCTAA GTTATCTGGA 
AGAATGAACT GGTGCCACGA ATGCATTTAC GATTTCGATT CAATAGACCT 

Right Arm 

7651 AATACTCCTT TACATAAAGC TATGTTATCT AATAGTTTTA ATAATATAAA 
TTATGAGGAA ATGTATTTCG ATACAATAGA TTATCAAAAT TATTATATTT 
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Right Arm 

7701. ATTACTTTTA TCTTATAACG CCGACTATAA TTCTCTAAAT AATCACGGTA 
TAATGAAAAT AGAATATTGC GGCTGATATT AAGAGATTTA TTAGTGCCAT 



7751 


ATACGCCTCT 
TATGCGGAGA 


Right Arm 

AACTTGTGTT AGCTTTTTAG ATGACAAGAT AGCTATTATG 
TTGAACACAA TCGAAAAATC TACTGTTCTA TCGATAATAC 


7001 


■ ATAATATCTA 
TATTATAGAT 


Right Arm 
AAATGATGTT AGAAATATCT 
TTTACTACAA TCTTTATAGA 


AAAAATCCTG 
TTTTTAGGAC 


AAATAGCTAA 
TTTATCGATT 


7851 


TTCAGAAGGT 
AAGTCTTCCA 


Right Arm 
TTTATAGTAA ACATGGAACA 
AAATATCATT TGTACCTTGT 


TATAAACAGT 
ATATTTGTCA 


AATAAAAGAC 
TTATTTTCTG 


7901 


TACTATCTAT 
ATGATAGATA 


Right. Arm 
AAAAGAATCA TGCGAAAAAG 
TTTTCTTAGT ACGCTTTTTC 


AACTAGATGT 
TTGATCTACA 


TATAACACAT 
ATA'JTGTGTA 


,7951 


ATAAAGTTAA 
TATTTCAATT 


Right Arm 
ATTCTATATA TTCTTTTAAT 
TAAGATATAT AAGAAAATTA 


ATCTTTCTTG ACAATAACAT 
TAGAAAGAAC TGTTATTGTA 


8001 


r 

AGATCTTATG 
TCTAGAATAC 


Right Arm 
GTAAAGTTCG TAACTAATCC TAGAGTTAAT 
CATTTCAAGC ATTGATTAGG ATCTCAATTA 


AAGATACCTG 
TTCTATGGAC 


8051 


CATGTATACG 
GTACATATGC 


Right Arm 
TATATATAGG GAATTAATAC 
ATATATATCC CTTAATTATG 


CJGAAAAATAA 
CCTTTTTATT 


ATCATTAGCT 
TAGTAATCGA 


0101 • 


TTTCATAGAC 
AAAGTATCTG 


Right Arm 
ATCAGCTAAT AGTTAAAGCT 
TAGTCGATTA TCAATTTCGA 


GTAAAAGAGA 
CATTTTCTCT 


GTAAGAATCT 
CATTCTTAGA 


8151 


AGGAATAATA 
TCCTTATTAT 


Right Arm 
GGTAGGTTAC CTATAGATAT 
CCATCCAATG GATATCTATA 


CAAACATATA 
GTTTGTATAT 


ATAATGGAAC 
TATTACCTTG 


8201 


TATTAAGTAA 
ATAATTCATT 


Right Arm 
TAATGATTTA CATTCTGTTA 
ATTACTAAAT GTAAGACAAT 


TCACCAGCTG 
AGTGGTCGAC 


TTGTAACCCA 
AACATTGGGT 


0251 


GTAGTATAAA 
CATCATATTT 


Right Arm 
GAGCTCCAGC TTTTGTTCCC 
CTCGAGGTCG AAAACAAGGG 


TTTAGTGAGG 
AAATCACTCC 


GTTAATTCCG 
CAATTAAGGC 



Right Arm 

8301 AGCTTGGCGT AATCATGGTC • ATAGCTGTTT CCTGTGTGAA ATTGTTATCC 
TCGAACCGCA TTAGTACCAG TATCGACAAA GGACACACTT TAACAATAGG 

8351 GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT 
CGAGTGTTAA GGTGTGTTGT . ATGCTCGGCC TTCGTATTTC ACATTTCGGA 

84 01 GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 
CCCCACGGAT TACTCACTCG ATTGAGTGTA ATTAACGCAA CGCGAGTGAC 

84 51 CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG 
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10 



• 15 



20 



25 



30 



35 



40 



45 



50 



55 



8501 
8551 
8601 
8651 
8701 
8751 
0801 
8051 
8901 
B951 
9001 
9051 
9101 
9151 
9201 
9251 
9301 
9351 ■ 
9401 
9451 
9501 
9551 
9601 
9651 
9701 
9751 
9801 



GGGCGAAAGG 
CCAACGCGCG 
GGTTGCGCGC 
CGCTCACTGA 
GCGAGTGACT 
GCTCACTCAA 
CGAGTGAGTT 
AGGAAAGAAC 
TCCTTTCTTG 
• AGGCCGCGTT 
TCCGGCGCAA 
CACAAAAATC 
GTGTTTTTAG 
AAGATACCAG 
TTCTATGGTC 
CGACCCTGCC 
GCTGGGACGG 
' GTGGCGCTTT 
CACCGCGAAA 
CGTTCGCTCC 
GCAAGCGAGG 
GCTGCGCCTT 
CGACGCGGAA 
' GACTTATCGC 
CTGAATAGCG 
GTATGTAGGC 
CATACATCCG 
ACACTAGAAG 
TGTGATCTTC 
TTCGGAAAAA 
AAGCCTTTTT 
TAGCGGTGGT 
ATCGCCACCA 
GATCTCAAGA 
CTAGAGTTCT 
AACGAAAACT 
TTGCTTTTGA 
CTTCACCTAG 
GAAGTGGATC 
GTATATATGA 
CATATATACT 
GCACCTATCT 
CGTGGATAGA 
CCCCGTCGTG 
GGGGCAGCAC 
GTGCTGCAAT 
CACGACGTTA 
GCAATAAACC 
CGTTATTTGG 
TTTATCCGCC 
AAATAGGCGG' 
GTAGTTCGCC 
CATCAAGCGG 
ATCGTGGTGT 
TAGCACCACA 



TCAGCCCTTT 
GGGAGAGGCG 
CCCTCTCCGC 
CTCGCTGCGC 
GAGCGACGCG 
AGGCGGTAAT 
TCCGCCATTA 
ATGTGAGCAA 
TACACTCGTT 
GCTGGCGTTT 
CGACCGCAAA 
GACGCTCAAG 
CTGCGAGTTC 
GCGTTTCCCC 
CGCAAAGGGG 
GCTTACCGGA 
CGAATGGCCT 
CTCATAGCTC 
GAGTATCGAG 
AAGCTGGGCT 
TTCGACCCGA 
ATCCGGTAAC 
TAGGCCATTG 
CACTGGCAGC 
GTGACCGTCG 
GGTGCTACAG 
CCACGATGTC 
GACAGTATTT 
CTGTCATAAA 
GAGTTGGTAG 
CTCAACCATC 
TTTTTTGTTT 
AAAAAACAAA 
AGATCCTTTG 
TCTAGGAAAC 
CACGTTAAGG 
GTGCAATTCC 
ATCCTTTTAA 
TAGGAAAA'J'T 
GTAAACTTGG 
CATTTGAACC 
CAGCGATCTG 
GTCGCTAGAC 
TAGATAACTA 
ATCTATTGAT 
GATACCGCGA 
CTATGGCGCT 
AGCCAGCCGG 
TCGGTCGGCC 
TCCATCCAGT 
AGGTAGGTCA 
AGTTAATAGT 
TCAATTATCA 
CACGCTCGTC 
GTGCGAGCAG 



GGACAGCACG 
GTTTGCGTAT 
CAAACGCATA 
TCGGTCGTTC 
AGCCAGCAAG 
ACGGTTATCC 
TGCCAATAGG 
AAGGCCAGCA 
TTCCGGTCGT 
TTCCATAGGC 
AAGGTATCCG 
TCAGAGGTGG 
AGTCTCCACC 
CTGGAAGCTC 
GACCTTCGAG 
TACCTGTCCG 
ATGGACAGGC 
ACGCTGTAGG 
TGCGACATCC 
GTGTGCACGA 
CACACGTGCT 
TATCGTCTTG 
ATAGCAGAAC 
AGCCACTGGT 
TCGGTGACCA 
AGTTCTTGAA 
TCAAGAACTT 
GGTATCTGCG 
CCATAGACGC 
CTCTTGATCC 
GAGAACTAGG 
GCAAGCAGCA 
CGTTCGTCGT 
ATCTTTTCTA 
TAGAAAAGAT 
GATTTTGGTC 
CTAAAACCAG 
ATTAAAAATG 
TAATTTTTAC 
TCTGACAGTT 
AGACTGTCAA* 
TCTATTTCGT 
AGATAAAGCA 
CGATACGGGA 
GCTATGCCCT 
GACCCACGCT 
CTGGGTGCGA 
AAGGGCCGAG 
TTCCCGGCTC 
CTATTAATTG 
GATAATTAAC 
TTGCGCAACG 
AACGCGTTGC 
GTTTGGTATG 
CAAACCATAC 



GTCGACGTAA 
TGGGCGCTCT 
ACCQGCGAGA 
GGCTGCGGCG 
CCGACGCCGC 
ACAGAATCAG 
TGTCTTAGTC 
AAAGGCCAGG 
TTTCCGGTCC 
TCCGCCCCCC 
AGGCGGGGGG 
CGAAACCCGA 
GCTTTGGGCT 
CCTCGTGCGC 
GGAGCACGCG 
CCTTTCTCCC 
GGAAAGAGGG 
TATCTCAGTT 
ATAGAGTCAA 
ACCCCCCGTT 
TGGGGGGCAA 
AGTCCAACCC 
TCAGGTTGGG 
AACAGGATTA 
TTGTCCTAAT 
GTGGTGGCCT 
CACCACCGGA. 
CTCTGCTGAA 
GAGACGACTT 
GGCAAACAAA 
CCGTTTGTTT 
GATTACGCGC 
CTAATGCGCG 
CGGGGTCTGA 
GCCCCAGACT 
ATGAGATTAT 
TACTCTAATA 
AAGTTTTAAA 
TTCAAAATTT 
ACCAATGCTT 
TGGTTACGAA 
TCATCCATAG 
AGTAGGTATC 
GGGCTTACCA 
CCCGAATGGT 
CACCGGCTCC 
GTGGCCGAGG 
CGCAGAAGTG 
GCGTCTTCAC 

ttgccgggaa' 
aacggccctt 

TTGTTGCCAT 
AACAACGGTA 
GCTTCATTCA 
CGAAGTAAGT 



TTACTTAGCC 
TCCGCTTCCT 
AGGCGAAGGA 
AGCGGTATCA 
TCGCCATAGT 
GGGATAACGC 
CCCTATTGCG 
AACCGTAAAA 
TTGGCATTTT 
TGACGAGCAT 
ACTGCTCGTA 
CAGGACTATA 
GTCCTGATAT 
TCTCCTGTTC 
AGAGGACAAG 
TTCGGGAAGC 
AAGCCCTTCG 
CGGTGTAGGT 
GCCACATCCA 
CAGCCCGACC 
GTCGGGCTGG 
GGTAAGACAC 
CCATTCTGTG 
GCAGAGCGAG 
CGTCTCGCTC 
AACTACGGCT 
TTGATGCCGA 
GCCAGTTACC 
'CGGTCAATGG 
CCACCGCTGG 
GGTGGCGACC 
AGAAAAAAAG 
TCTTTTTTTC 
CGCTCAGTGG 
GCGAGTCACC * 
CAAAAAGGAT 
GTTTTTCCTA 
TCAATCTAAA 
AGTTAGATTT 
AATCAGTGAG 
TTAGTCACTC 
TTGCCTGACT 
AACGGACTGA 
TCTGGCCCCA 
AGACCGGGGT 
AGATTTATCA 
TCTAAATAGT 
GTCCTGCAAC 
CAGGACGTTG 
GCTAGAGTAA 
CGATCTCATT " 
TGCTACAGGC 
ACGATGTCCG 
GCTCCGGTTC 
CGAGGCCAAG 
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9851 CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 
GGTTGCTAGT TCCGCTCAAT GTACTAGGGG GTACAACACG TtTTTTCGCC 
9901 TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA. GAAGTAAGTT GGCCGCAGTG 
AATCGAGGAA GCCAGGAGGC TAGCAACAGT CTTCATTCAA CCGGCGTCAC 
5 9951 TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC 
AATAGTGAGT ACCAATACCG TCGTGACGTA TTAAGAGAAT GacAGTACGG 
10001 ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT 
TAGGCATTCT ACGAAAAGAC ACTGACCACT CATGAGTTGG TTCAGTAAGA 
10051 GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG 
10 ■ CTCTTATCAC ATACGCCGCT GGCTCAACGA GAACGGGCCG CAGTTATGCC 

10101 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA 
CTATTATGGC GCGGTGTATC GTCTTGAAAT TTTCACGAGT AGTAACCTTT 
10151 • ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 
TGCAAGAAGC CCCGCTTTTG AGAGTTCCTA GAATGGCGAC AACTCTAGGT 
IS 10201 GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT 
CAAGCTACAT TGGGTGAGCA CGTGGGTTGA CTAGAAGTCG TAGAAAATGA 
10251 TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 
AAGTGGTCGC AAAGACCCAC TCGTTTTTGT CCTTCCGTTT TACGGCGTTT 
10301 AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA. CTCTTCCTTT 
20 TTTCCCTTAT TCCCGCTGTG CCTTTACAAC TTATGAGTAT GAGAAGGAAA 

10351 TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC 
AAGTTATAAT AACTTCGTAA ATAGTCCCAA TAACAGAGTA CTCGCCTATG 
* 104 01 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT 
TATAAACTTA CATAAATCTT TTTATTTGTT .TATCCCCAAG GCGCGTGTAA 
25 10451 TCCCCGAAAA GTGCCACCTGAGGGGCTnT CACGGTGGAC 
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FIGURE 3 : Donor plnsmid pi 132 

C5 Right Arm 

' 1 . .TGAATGTTAA ATGTTATACT TTGGATGAAG CTATAAATAT ~GCATTGGAAA 
ACTTACAATT TACAATATGA AACCTACTTC GATATTTATA CGTAACCTTT 
• . C5 Right Arm 

51 AATAATCCAT TTAAAGAAAG GATTCAAATA CTACAAAAdc~TMGCGATAA 
TTATTAGGTA AATTTCTTTC CTAAGTTTAT GATGTTTTGG ATTCGCTATT 
C5 Right Arm 

101 TATGTTAACr AAGCTTATTC TTAACGACGC TTTAAATATA ^CACAAATAAA 
ATACAATTGA TTCGAATAAG AATTGCTGCG AAATTTATAT SSJJSJt 
,J • C5 Right Arm 

151 CATAATTTTT GTATAACCTA ACAAATAACT~AAAACATAAS "aATAATAAAA 
GTATTAAAAA CATATTGGAT TGTTTATTGA TTTTGTATTT TTATTATTTT 
2Q C5 Right -Arm 

• 201 GGAAATGTAA TATCGTAATT ATTTTACTCA GGA^TGGGGT^TAAATATTTA 
CCTTTACATT ATAGCATTAA TAAAATGAGT CCTTACCCCA ATTTATAAAT 
C5 Right Arm 

25 251 TATCACGTGT ATATCTATAC TGTTATCGTA TACTCTOTAC^MTTACTATT 

ATAGTGCACA TATAGATATG ACAATAGCAT ATGAGAAATG TTMTGATaI 
C5 Right Arm 

301 ACGAATATGC AAGAGATAAT AAGATTACGT ATTTAAGAGa'aTCTTGTCAT 
TGCTTATACG TTCTCTATTA TTCTAATGCA TAAATTCTCT TAGAACAGTA 
C5 Right Arm 

351 GATAATTGGG TACGACATAG TGATAAATGC TATTTCGCAT~CGTTACATAA 
CTATTAACCC ATGCTGTATC ACTATTTACG ATAAAGCGTA GCAAfGTAT? ' 
C5 Right Arm 

401 AGTCAGTTGG AAAGATGGAT TTGACAGATG TAA^ T TAATA ^GGTGC AAAAA 
TCAGTCAACC TTTCTACCTA AACTGTCTAC ATTGAATTAT CCACGTTTTT 
C5 Right Arm 

451 TGTTAAATAA CAGCATTCTA TCGGAAGATA GGATACCAGT~TATATTATAC - 
ACAATTTATT GTCGTAAGAT AGCCTTCTAT CCTATGGTCA AtSSg 
C5 Right Arm 

45 ™^ ATCACT GGTTGG ™ MCAGATTCTlc^ATOCrT^GATGA 

TTTTTAGTGA CCAACCTATT TTGTCTAAGA CGTTATAAGC ATTTTCTACT 
C5 Right Arm 

551 AGATTACTGC GAATTTGTAA ACTATGACAA TA^GCCrTTTATCTCAA 
50 TCTAATGACG CTTAAACATT TGATACTGTT ATTTTTCGGT AAATAGAGTT 

C5 Right Arm 

. • 601 CGACATCGTG TAATTCTTCC ATGTTTTATG TATGTGTTTC^AGATATTATG 

^ GCTGTAGCAC ATTAAGAAGG TACAAAATAC ATACACAAAG TCTATAATAC 



30 



35 



40 
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651 AGATTACTAT AAACTTTTTG TATACTTATA TTCCGTAAAC TATATTAATC 
TCTAATGA.TA TTTGAAAAAC ATATGAATAT AAGGCATTTG ATATAATTAG 
C5 Right Arm 

.701 ATGAAGAAAA TGAAAAAGTA TAGAAGCTGT TCACGAGCGG TTGTTGAAAA 
TACTTCTTTT ACTTTTTCAT ATCTTCGACA AGTGCTCGCC AACAACTTTT 
C5 Right Arm 



751 CAACAAAATT ATACATTCAA GATGGCTTAC ATATACGTCT GTGAGGCTAT 
GTTGTTTTAA TATGTAAGTT CTACCGAATG TATATGCAGA CACTCCGATA ' 
C5 Right Arm 

15 ' 

801 CATGGATAAT GACAATGCAT CTCTAAATAG GTTTTTGGAC AATGGATTCG . 
GTACCTATTA CTGTTACGTA GAGATTTATC CAAAAACCTG TTACCTAAGC 
C5 Right Arm 

20 851 ACCCTAACAC GGAATATGGT ACTCTACAAT CTCCTCTTGA AATGGCTGTA 

TGGGATTGTG CCTTATACCA TGAGATGTTA GAGGAGAACT TTACCGACAT- 
C5 Right Arm 

• 901 ATGTTCAAGA ATACCGAGGC TATAAAAATC TTGATGAGGT ATGGAGCTAA 
25 TACAAGTTCT TATGGCTCCG ■ ATATTTTTAG AACTACTCCA TACCTCGATT 

C5 Right Arm 

951 ACCTGTAGTT ACTGAATGCA CAACTTCTTG TCTGCATGAT GCGGTGTTGA 
. TGGACATCAA ■ TGACTTACGT GTTGAAGAAC AGACGTACTA CGCCACAACT 
30 C5 Right Arm 

1001 GAGACGACTA CAAAATAGTG AAAGATCTGT TGAAGAATAA CTATGTAAAC 
CTCTGCTGAT GTTTTATCAC TTTCTAGACA ACTTCTTATT GATACATTTG 
C5 Right Arm 

35 ' ~ ^ 

1051 AATGTTCTTT ACAGCGGAGG CTTTACTCCT TTGTGTTTGG CAGCTTACCT 
TTACAAGAAA TGTCGCCTCC GAAATGAGGA AACACAAACC GTCGAATGGA 
C5 Right Arm 

40 1101 TAACAAAGTT AATTTGGTTA AACTTCTATT GGCTCATTCG GCGGATGTAG 
• ATTGTTTCAA TTAAACCAAT TTGAAGATAA CCGAGTAAGC CGCCTACATC 
C5 Right Arm 

1151 ATATTTCAAA CACGGATCGG TTAACTCCTC TACATATAGC CGTATCAAAT ' 
45 TATAAAGTTT GTGCCTAGCC AATTGAGGAG ATGTATATCG GCATAGTTTA 

C5 Right Arm. 

1201 AAAAATTTAA CAATGGTTAA ACTTCTATTG AACAAAGGTG CTGATACTGA 
TTTTTAAATT GTTACCAATT TGAAGATAAC TTGTTTCCAC GACTATGACT 
50 . C5. Right Arm 

1251 ' CTTGCTGGAT AACATGGGAT GTACTCCTTT AATGATCGCT GTACAATCTG 
GAACGACCTA TTGTACCCTA. CATGAGGAAA TTACTAGCGA CATGTTAGAC 

55 ■ ' 
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C5 Right Arm 

1301 GAAATATTGA AATATGTAGC ACACTACTTA AAAAAMTAA~AATGTCCAGA 
CTTTATAACT TTATACATCG TGTGATGAAT TTTTTTTATT mCAGCTC? • 
C5 Right Arm 

1351 ACTGGGAAAA ATTGATCTTG CCAGCTGTAA TTCATGGTAG "aAAAGAAGT? 

TGACCCTTTT TAACTAGAAC GGTCGACATT AAGTACCATC toS^S 
C5 Right Arm 

1401 . "S^TAC TTTTCAACAA AGGAGCAGA7G^CTACr^CTTOGA^G ■ 
GAGTCCGATG AAAAGTTGTT TCCTCGTCTA CATTTGATGT AGAAACTTTC 
' C5 Right Arm 

1451 mSSJJ 

TTTACCTTTT AG1ATATGAC AAAACCTTAA CTAATTTCTT TCAATGAGAC 

C5 Right Arm 

1501 GAGGTAGCTG AAGTGGTACT ~CTCAAAGGTA CGTGACTAAT 

TCTGTGTTTT CTCCATCGAC TTCACCATGA GAGTTTCCAT GCACTgI™ 

Repeat Region 

1551 1*™^ AAGGATCGG C CGCTCTAGAA CTAGTGGATC GGGTTCTTTA 
ATCGATATTT TTCCTAGCCG GCGAGATCTT GATCACCTAG CCCAAaSS 
Repeat Region 

1601 V™*™** MTAMTACA~^GGTTCTTG~AGGGTTGTGT 

AAGATATGAA TTTTTCAC.TT TTATTTATGT TTCCAAGAAC TCCCmSS 
Repeat Region 

1651 —,1?^ GCGAGAAA ™ ATCATA^TT~ATTTCATTAT~CGC^TATCC ' 
ATTTAACTTT CGCTCTTTAT TAGTATTTAA TAAAGTAATA GCGcKS 
Repeat Region 

1701 GTTAAGTTTG TATCGTACCC CGATCCCCCg"aGCCATGCAG~GCCG^GGCC 
CAATTCAAAC ATAGCATGGG GCTAGGGGGC TCGGTAcSc SSJS 
Repeat Region 

1751 GGGGCACAGG GGGTTCGACG GGCGATGCTG~ATGGCCCAGG~AGrrrrTr^ • 
CCCCGTGTCC CCCAAGCTGC CCGCTACGAC tSSSS JSgScCG ' 
Repeat Region ' 

1801 ATTCCTGATG GCCCAGGGGG CMTGCTGGC^GGCCCAGGAG^AGRrr^^rn ' 
TAAGGACTAC CGGGTCCCCC GTTACGACCG cSS JSSSg ' 
Repeat Region 

1051 ^ GGCGGC AG AGGTCCCC GGGGCGCTGG~GGCAGCMGg"gCCTCGTCGC 
GTGCCCGCCG TCTCCAGGGG CCCCGCGTCC CCGTCGTTCC CGGaSSg 
Repeat Region 

1901 gc?c??cJcc slii^ 

GCCCTCCTCC GCGGGGCGCC CCAGGCGTAC CGCCGCGCCG AAGTCCCGAC 



1951 



Repeat Region 

AATGGATGCT GCAGATGCGG GGCCAGGGGrcc^GAGCrGCCTGCTTCA ' 
TTACCTACGA CGTCTACGCC CGGGTCCCCC GGCCTCTCGG CGGACGAACT 
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Repeat Region 



2001 GTTCTACCTC GCCATGCCTT TCGCGACACC CATAGCTTGA TATCGAATTC 
CAAGATGGAG CGGTACGGAA AGCGCTGTGG GTATCGAACT ATAGCTTAAG 

C1B promoter 



2051 


TAGGGGGATC 
ATCCCCCTAG 


CACTAGTTCT AGAGGATCAT 
GTGATCAAGA TCTCCTAGTA 
C1B promoter 


TATTTAACGT 
ATAAATTGCA 


AAACTAAATG 
TTTGATTTAC 


2101 


GAAAAGCTAT 
CTTTTCGATA 


TTACAGGTAC ATACGGTGTT 
AATGTCCATG TATGCCACAA 
C1B promoter 


TTTCTGGAAT 
AAAGACCTTA 


CAAATGATTC 
GTTTACTAAG 


2151 


TGATTTTGAG 
ACTAAAACTC 


GATTTTATCA ATACAATAAT 
CTAAAATAGT TATGTTATTA 
C1B promoter 


GACAGTGCTA 
CTGTCACCAT 


ACTGGTAAAA 
TGACCATTTT 


2201 


AAGAAAGCAA 
TTCTTTCGTT 


ACAATTATCA TGGCTAACAA 
TGTTAATAGT ACCGATTGTT 
C1B promoter 


TTT.TTATTAT 
AAAAATAATA 


ATTTGTAGTA 
TAAACATCAT 



2251 TGCATAGTGG TCTTTACGTT TCTTTATTTA AAGTTAATGT GTTAAGATTA 
ACGTATCACC AGAAATGCAA AGAAATAAAT TTCAATTACA CAATTCTAAT 
C1B promoter LacZ 



2301 


AATGGAGTAA 
TTACCTCATT 


TTGGATCCCC 
AACCTAGGGG 


CATCGATGGG GAATTCACTG GCCGTCGTTT 
GTAGCTACCC CTTAAGTGAC CGGCAGCAAA 
LacZ 


2351 


TACAACGTCG 
ATGTTGCAGC 


TGACTGGGAA 
ACTGACCCTT 


AACCCTGGCG TTACCCAACT TAATCGCCTT 
TTGGGACCGC AATGGGTTGA ATTAGCGGAA 
LacZ 


2401 


GCAGCACATC 
CGTCGTGTAG 


CCCCTTTCGC 
GGGGAAAGCG 


CAGCTGGCGT AATAGCGAAG AGGCCCGCAC 
GTCGACCGCA TTATCGCTTC TCC-GGGCGTG 
LacZ 


2451 


CGATCGCCCT 
GCTAGCGGGA 


TCCCAACAGT 
AGGGTTGTCA 


TGCGCAGCCT GAATGGCGAA TGGCGCTTTG 
ACGCGTCGGA CTTACCGCTT ACCGCGAAAC 
LacZ 


2501 


CCTGGTTTCC 
GGACCAAAGG 


GGCACCAGAA 
CCGTGGTCTT 


GCGGTGCCGG AAAGCTGGCT GGAGTGCGAT 
CGCCACGGCC TTTCGACCGA CCTCACGCTA 
LacZ 


2551 


CTTCCTGAGG 
GAAGGACTCC 


CCGATACTGT 
GGCTATGACA 


CGTCGTCCCC TCAAACTGGC AGATGCACGG 
GCAGCAGGGG AGTTTGACCG TCTACGTGCC 
LacZ 


2601 


TTACGATGCG 
AATGCTACGC 


CCCATCTACA CCAACGTGAC CTATCCCATT ACGGTCAATG 
GGGTAGATGT GGTTGCACTG GATAGGGTAA TGCCAGTTAG 
LacZ 



2651 CGCCGTTTGT TCCCACGGAG AATCCGACGG GTTGTTACTC GCTCACATTT 



GCGGCAAACA AGGGTGCCTC TTAGGCTGCC CAACAATGAG CGAGTGTAAA 



19/53 



WO 2005/026370 



PCT/US20IM/02875J 



LacZ 



2701 AATGTTGATG AAAGCTGGCT ACAGGAAGGC CAGACGCGAA TTATTTTTGA 
TTACAACTAC TTTCGACCGA TGTCCTTCCG GTCTGCGCTT AATAAAAACT 

LacZ 

27 51 TGGCGTTAAC TCGGCGTTTC ATCTGTGGTG CAACGGGCGC TGGGTCGGTT 
ACCGCAATTG AGCCGCAAAG TAGACACCAC GTTGCCCGCG ACCCAGCCAA 

LacZ 



2801 . ACGGCCAGGA CAGTCGTTTG CCGTCTGAAT TTGACCTGAG CGCATTTTTA 
TGCCGGTCCT GTCAGCAAAC GGCAGACTTA AACTGGACTC GCGTAAAAAT 

LacZ 



2851 CGCGCCGGAG AAAACCGCCT CGCGGTGATG GTGCTGCGCT GGAGTGACGG 

GCGCGGCCTC TTTTGGCGGA GCGCCACTAC CACGACGCGA CCTCACTGCC . 

LacZ 



2901 CAGTTATCTG GAAGATCAGG ATATGTGGCG GATGAGCGGC ATTfTCCGTG 
GTCAATAGAC CTTCTAGTCC TATACACCGC CTACTCGCCG TAAAAGGCAC 

LacZ 



2951 ACGTCTCGTT GCTGCATAAA CCGACTACAC AAATCAGCGA TTTCCATGTT 
TGCAGAGCAA CGACGTATTT GGCTGATGTG TTTAGTCGCT AAAGGTACAA 

LacZ 

3001 GCCACTCGCT TTAATGATGA TTTCAGCCGC GCTGTACTGG AGGCTGAAGT 
CGGTGAGCGA AATTACTACT AAAGTCGGCG CGACATGACC TCCGACTTCA 

LacZ 



3051 TCAGATGTGC GGCGAGTTGC GTGACTACCT ACGGGTAACA GTTTCTTTAT 
AGTCTACACG CCGCTCAACG CACTGATGGA TGCCCATTGT CAAAGAAATA 

LacZ 



3101 GGCAGGGTGA AACGCAGGTC GCCAGCGGCA CCGCGCCTTT CGGCGGTGAA 
CCGTCCCACT TTGCGTCCAG CGGTCGCCGT GGCGCGGAAA GCCGCCACTT 

LacZ 



3151 ATTATCGATG AGCGTGGTGG TTATGCCGAT CGCGTCACAC TACGTCTGAA" 
TAATAGCTAC TCGCACCACC AATACGGCTA GCGCAGTGTG ATGCAGACTT 

LacZ 



3201 CGTCGAAAAC CCGAAACTGT GGAGCGCCGA AATCCCGAAT CTCTATCGTG 
GCAGCTTTTG GGCTTTGACA CCTCGCGGCT TTAGGGCTTA GAGATAGCAC 

LacZ 

3251 CGGTGGTTGA ACTGCACACC GCCGACGGCA CGCTGATTGA AGCAGAAGCC 
GCCACCAACT TGACGTGTGG CGGCTGCCGT GCGACTAACT TCGTCTTCGG 

LacZ 

3301 TGCGATGTCG GTTTCCGCGA GGTGCGGATT GAAAATGGTC 7GCTGCTGCT 
ACGCTACAGC CAAAGGCGCT CCACGCCTAA CTTTTACCAG ACGACGACGA 

LacZ 



3351 GAACGGCAAG CCGTTGCTGA TTCGAGGCGT TAACCGTCAC GAGCATCATC 
CTTGCCGTTC GGCAACGACT AAGCTCCGCA ATTGGCAGTG CTCGTAGTAG 



20/53 



WO 2005/026370 



PCT/US2004/028751 



LacZ 



5 • 


3401 


CTCTGCATGG TCAGGTCATG 
GAGACGTACC AGTCCAGTAC 


GATGAGCAGA CGATGGTGCA GGATATCCTG 
CTACTCGTCT GCTACCACGT CCTATAGGAC 
LacZ 


.10 


3451 


CTGATGAAGC AGAACAACTT TAACGCCGTG CGCTGTTCGC ATTATCCGAA 
GACTACTTCG TCTTGTTGAA ATTGCGGCAC GCGACAAGCG TAATAGGCTT 

LacZ 


• 3501 


CCATCCGCTG 
GGTAGGCGAC 


TGGTACACGC 
ACCATGTGCG 


TGTGCGACCG CTACGGCCTG TATGTGGTGG 
ACACGCTGGC GATGCCGGAC ATACACCACC 
LacZ 


15 


3551 


ATGAAGCCAA TATTGAAACC CACGGCATGG TGCCAATGAA TCGTCTGACC 
TACTTCGGTT ATAACTTTGG GTGCCGTACC ACGGTTACTT AGCAGACTGG 

LacZ 


20 


3601 


GATGATCCGC 
CTACTAGGCG 


GCTGGCTACC 
CGACCGATGG 


GGCGATGAGC GAACGCGTAA CGCGAATGGT. 
CCGCTACTCG CTTGCGCATT ' GCGCTTACCA 
LacZ 


25 


3651 


GCAGPGPGAT 
CGTCGCGCTA 


rGTARTPAPP 

GCATTAGTGG 


PP.AGTP.TP.RT PATPTGP.TPP PTPPPP2V&Tr* 

GCTCACACTA GTAGACCAGC GACCCCTTAC 
LacZ 


30 


3701 


AATCAGGCCA 
TTAGTCCGGT 


CGGCGCTAAT 
GCCGCGATTA 


CACGACGCGC TGTATCGCTG GATCAAATCT 
GTGCTGCGCG ACATAGCGAC CTAGTTTAGA 
LacZ 


3751 


■ GTCGATCCTT 
CAGCTAGGAA 


CCCGCCCGGT 
GGGCGGGCCA 


GCAGTATGAA GGCGGCGGAG CCGACACCAC 
CGTCATACTT CCGCCGCCTC GGCTGTGGTG 
■ LacZ 


35 • 


3801 


GGCCACCGAT 
CCGGTGGCTA 


ATTATTTGCC 
TAATAAACGG 


CGATGTACGC GCGCGTGGAT GAAGACCAGC 
GCTACATGCG CGCGCACCTA CTTCTGGTCG 
LacZ 


40 


3051 


CCTTCCCGGC 
GGAAGGGCCG 


TGTGCCGAAA 
ACACGGCTTT 


TGGTCCATCA AAAAATGGCT TTCGCTACCT 
ACCAGGTAGT TTTTTACCGA AAGCGATGGA 
LacZ 


45 


. 3901- 


GGAGAGACGC 
CCTCTCTGCG 


GCCCGCTGAT 
CGGGCGACTA 


CCTTTGCGAA TACGCCCACG CGATGGGTAA 
GGAAACGCTT ATGCGGGTGC GCTACCCATT 
LacZ 


50 


3951 


CAGTCTTGGC 
GTCAGAACCG 


GGTTTCGCTA 
. CCAAAGCGAT 


AATACTGGCA GGCGTTTCGT CAGTATCCCC 
TTATGACCGT CCGCAAAGCA GTCATAGGGG 
LacZ 


4001 


GTT.TACAGGG 
CAAATGTCCC 


• CGGCTTCGTC 
GCCGAAGCAG 


TGGGACTGGG TGGATCAGTC GCTGATTAAA 
ACCCTGACCC ACCTAGTCAG CGACTAATTT 
LacZ 



55 4 051 TATGATGAAA ACGGCAACCC GTGGTCGGCT TACGGCGGTG ATTTTGGCGA 



ATACTACTTT TGCCGTTGGG CACCAGCCGA ATGCCGCCAC TAAAACCGCT 
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LacZ 



5 


4101 


TACGCCGAAC gatcgccagt tctgtatgaa cggtctggtc 

ATGCGGCTTG CTAGCGGTCA AGACATACTT GCCAGACCAG 

LacZ 


TTTGCCGACC 


10 


4151 


GCACGCCGCA 
CGTGCGGCGT 


TCCAGCGCTG ACGGAAGCAA AACACCAGCA 
AGGTCGCGAC TGCCTTCGTT TTGTGGTCGT 
LacZ 


GCAGTTTTT6 
CGTCAAAAAG 




4201 


CAGTTCCGTT 
GTCAAGGCAA 


TATCCGGGCA AACCATCGAA GTGACCAGCG 
ATAGGCCCGT TTGGTAGCTT CACTGGTCGC 
LacZ 


AATACCTGTT 
TTATGGACM 


15 


4251 ' 


CCGTCATAGC 
GGCAGTATCG 


GATAACGAGC TCCTGCACTG GATGGTGGCG 
CTATTGCTCG AGGACGTGAC CTACCACCGC 
LacZ 


CTGGATGGTA 
GACCTACCAT 


20 


4301 


AGCCGCTGGC 
TCGGCGACCG 


AAGCGGTGAA GTGCCTCTGG ATGTCGCTCC ACAAGGTAAA 
TTCGCCACTT CACGGAGACC TACAGCGAGG TGTTCCATTT 
LacZ 



4351 CAGTTGATTG AACTGCCTGA ACTACCGCAG CCGGAGAGCG CCGGGCAACT ' 
GTCAACTAAC TTGACGGACT TGATGGCGTC GGCCTCTCGC GGCCCGTTGA 

.LacZ 



30 


4401 


CTGGCTCACA 
GACCGAGTGT 


GTACGCGTAG TGCAACCGAA CGCGACCGCA TGGTCAGAAG 
CATGCGCATC ACGTTGGCTT GCGCTGGCGT ACCAGTCTTC 
LacZ 




4451 


CCGGGCACAT 
GGCCCGTGTA 


CAGCGCCTGG CAGCAGTGGC GTCTGGCGGA AAACCTCAGT 
GTCGCGGACC GTCGTCACCG CAGACCGCCT TTTGGAGTCA 
LacZ 


35 


4 501 


' GTGACGCTCC 
CACTGCGAGG 


CCGCCGCGTC CCACGCCATC CCGCATCTGA CCACCAGCGA 
GGCGGCGCAG GGTGCGGTAG GGCGTAGACT GGTGGTCGCT 
LacZ 


40 


4551 


AATGGATTTT 
TTACCTAAAA 


TGCATCGAGC TGGGTAATAA GCGTTGGCAA TTTAACCGCC 
ACGTAGCTCG ACCCATTATT CGCAACCGTT AAATTGGCGG 
LacZ 


45 


4601 


AGTCAGGCTT 
TCAGTCCGAA 


TCTTTCACAG ATGTGGATTG GCGATAAAAA ACAACTGCTG 
AGAAAGTGTC TACACCTAAC CGCTATTTTT TGTTGACGAC 
LacZ 


50 


4 651 


ACGCCGCTGC 
TGCGGCGACG 


GCGATCAGTT CACCCGTGCA CCGCTGGATA ACGACATTGG 
CGCTAGTCAA GTGGGCACGT GGCGACCTAT TGCTGTAACC 
LacZ 




4701 


CGTAAGTGAA 
GCATTCACTT 


GCGACCCGCA TTGACCCTAA CGCCTGGGTC GAACGCTGGA 
CGCTGGGCGT AACTGGGATT GCGGACCCAG CTTGCGACCT 
. LacZ 


55 ' 


47-51 


AGGCGGCGGG 
TCCGCCGCCC 


CCATTACCAG GCCGAAGCAG CGTTGTTGCA GTGCACGGCA 
GGTAATGGTC CGGCTTCGTC GCAACAACGT CACGTGCCGT 
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LacZ 

4801 GATACACTTG CTGAT.GCGGT GCTGATTACG ACCGCTCACG CGTGGCAGCA 
CTATGTGAAC GACTACGCCA CGACTAATGC TGGCGAGTGC GCACCGTCGT 

LacZ 

4B51 TCAGGGGAAA ACCTTATTTA TCAGCCGGAA AACCTACCGG ATTGATGGTA 
AGTCCCCTTT TGGAATAAAT AGTCGGCCTT TTGGATGGCC TAACTACCAT 

LacZ 

4 901 GTGGTCAAAT GGCGATTACC GTTGATGTTG AAGTGGCGAG CGATACACCG 
CACCAGTTTA CCGCTAATGG CAACTACAAC TTCACCGCTC GCTATGTGGC 

LacZ 

15 4 951 CATCCGGCGC GGATTGGCCT GAACTGCCAG CT.GGCGCAGG TAGCAGAGCG 
GTAGGCCGCG CCTAACCGGA CTTGACGGTC GACCGCGTCC ATCGTCTCGC 

LacZ 

5001 GGTAAACTGG CTCGGATTAG GGCCGCAAGA AAACTATCCC GACCGCCTTA 
20 CCATTTGACC GAGCCTAATC CCGGCGTTCT TTTGATAGGG CTGGCGGAAT 

LacZ 

5051 CTGCCGCCTG TTTTGACCGC TGGGATCTGC CATTGTCAGA CATGTATACC 
'GACGGCGGAC AAAACTGGCG ACCCTAGACG GTAACAGTCT GTACATATGG 
25 LacZ 

5101 CCGTACGTCT TCCCGAGCGA AAACGGTCTG CGCTGCGGGA CGCGCGAATT 
GGCATGCAGA AGGGCTCGCT TTTGCCAGAC GCGACGCCCT GCGCGCTTAA 

LacZ 



30 



5151 • GAATTATGGC CCACACCAGT GGCGCGGCGA CTTCCAGTTC AACATCAGCC 
CTTAATACCG GGTGTGGTCA CCGCGCCGCT GAAGGTCAAG TTGTAGTCGG 

LacZ 

35 5201 GGTACAGTCA ACAGCAATTG ATGGAAACCA GCCATTCGCC ATCTGCTGCA. 

CCATGTCAGT TGTCGTTAAC TACCTTTGGT CGGTAAGCGG TAGACGACGT 

LacZ 

525*1 CGCGGAAGAG GCACATGGCT GAATATCGAC GGTTTCCATA TGGGGATTGG 
40 GCGCCTTCTC .CGTGTACCGA CTTATAGCTG CCAAAGGTAT ACCCCTAACC 

LacZ 

5301 TGGCGACGAC TCCTGGAGCC CGTCAGTATC GGCGGAATTC CAGCTGAGCG 
ACCGCTGCTG AGGACCTCGG GCAGTCATAG CCGCCTTAAG GTCGACTCGC 
45 LacZ ■ 



5351 CCGGTCGCTA CCATTACCAG TTGGTCTGGT GTCAAAAATA ATAATAACCG 
GGCCAGCGAT GGTAATGGTC AACCAGACCA CAGTTTTTAT TATTATTGGC 
5401 GGCAGGGGGG ATCCGGAGCT TATCGCAGAT CAATTCGATA TCAAGCTTAT 
50 CCGTCCCCCC TAGGCCTCGA ATAGCGTCTA GTTAAGCTAT AGTTCGAATA 

H6 Promoter 



54 51 CGATACCGTC GACGGTATCG ATAAGCTCTA GTGGAGGGTT CTTTATTCTA 
GCTATGGCAG CTGCCATAGC TATTCGAGAT CACCTCCCAA GAAATAAGAT 
55 H6 Promoter 
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15 



35 



5501 TACTTAAAAA GTGAAAATAA ATACAAAGGT TCTTGAGGGT TGTGTTAAAT 
ATGAATTTTT CACTTTTATT TATGTTTCCA AGAACTCCCA ACACAATTTA 
H6 Promoter 

5551 TGAAAGCGAG AAATAATCAT AAATTATTTC ATTATCGCGA TATCCGTTAA 
ACTTTCGCTC TTTATTAGTA TTTAATAAAG TAATAGCGCT ATAGGCAATT 
H6 Promoter NYESO-1 



5601 GTTTGTATCG TACCCCCCCC GAGCCATGCA GGCCGAAGGC CGGGGCACAG 
CAAACATAGC ATGGGGGGGG CTCGGTACGT CCGGCTTCCG GCCCCGTGTC 
NYESO-1 

5651 GGGGTTCGAC GGGCGATGCT GATGGCCCAG GAGGCCCTGG CATTCCTGAT 
CCCCAAGCTG CCCGCTACGA CTACCGGGTC CTCCGGGACC GTAAGGACTA 
NYESO-1 



5701 GGCCCAGGGG GCAATGCTGG CGGCCCAGGA GAGGCGGGTG CCACGGGCGG 
CCGGGTCCCC CGTTACGACC GCCGGGTCCT CTCCGCCCAC GGTGCCCGCC 
• NYESO-1 
20 m ^ _____ 

. 5751 CAGAGGTCCC CGGGGCGCAG GGGCAGCAAG GGCCTCGGGG CCGGGAGGAG 
GTCTCCAGGG GCCCCGCGTC CCCGTCGTTC CCGGAGCCCC GGCCCTCCTC 
NYESO-1 

25 , 5801 GCGCCCCGCG GGGTCCGCAT GGCGGCGCGG CTTCAGGGCT GAATGGATGC 
CGCGGGGCGC CCCAGGCGTA CCGCCGCGCC GAAGTCCCGA CTTACCTACG 
NYESO-1 

5851 TGCAGATGCG GGGCCAGGGG GCCGGAGAGC CGCCTGCTTG AGTTCTACCT 
30 ACGTCTACGC CCCGGTCCCC CGGCCTCTCG GCGGACGAAC TCAAGATGGA 

NYESO-1 



5901 CGCCATGCCT TTCGCGACAC CCATGGAAGC AGAGCTGGCC CGCAGGAGCC 
GCGGTACGGA AAGCGCTGTG GGTACCTTCG TCTCGACCGG GCGTCCTCGG 
NYESO-1 



5951 TGGCCCAGGA TGCCCGACCG CTTCCCGTGC CAGGGGTGCT TCTGAAGGAG 
ACCGGGTCCT ACGGGGTGGC GAAGGGCACG GTCCCCACGA AGACTTCCTC 
NYESO-1 

40 ^ 

. .6001 TTCACTGTGT CCGGCAACAT ACTGACTATC CGACTGACTG CTGCAGACCA 
AAGTGACACA GGCCGTTGTA TGACTGATAG GCTGACTGAC GACGTCTGGT 
NYESO-1 

45 6051 CCGCCAACTG CAGCTCTCCA TCAGCTCCTG TCTCCAGCAG CTTTCCCTGT 
GGCGGTTGAC GTCGAGAGGT AGTCGAGGAC AGAGGTCGTC GAAAGGGACA 
NYESO-1 

6101 TGATGTGGAT CACGCAGGTG TTTCTGCCCG TGTTTTTGGC TCAGCCTCCC 
50 ACTACACCTA GTGCGTCCAC AAAGACGGGC ACAAAAACCG AGTCGGAGGG 

NYESO-1 



55 



6151 TCAGQGCAGA GGCGCTAAGT AATTAATTTT TTTTTGGGCT GCAGGATCGC 
AGTCCCGTCT CCGCGATTCA TTAATTAAAA AAAAACJCCGA CGTCCTAGCG 



24/53 

BNSDOC1D: <WO 2O05O2S37OA2J_» 



WO 2005/02637O 



PCT/US2004/028751 



sE/L Promoter • 



6201 


TAGCAAAAAT TGAAATTTTA TTTTTTTTTT TTGGAATATA AATAAGCTCG 
ATCGTTTTTA ACTTTAAAAT AAAAAAAAAA AACCTTATAT TTATTCGAGC 

hTRP-2 




sE/L Promoter 






6251 


AAGCTCGAGC 
TTCGAGCTCG 


CATGAGCCCC 
GTACTCGGGG 


CTTTGGTGGG GGTTTCTGCT 
GAAACCACCC CCAAAGACGA 
hTRP-2 


CAGTTGCTTG 
GTCAACGAAC 


6301 


GGCTGCAAAA 
CCGACGTTTT 


TCCTGCCAGG 
AGGACGGTCC 


AGCCCAGGGT CAGTTCCCCC 
TCGGGTCCCA GTCAAGGGGG 
hTRP-2 


GAGTCTGCAT 
CTCAGACGTA 


6351 


GACGGTGGAC 
CTGCCACCTG 


AGCCTAGTGA 
TCGGATCACT 


ACAAGGAGTG CTGCCCACGC 
TGTTCCTCAC GACGGGTGCG 
hTRP-2 


CTGGGTGCAG 
GACCCACGTC 


6101 


AGTCGGCCAA 
TCAGCCGGTT 


TGTCTGTGGC 
ACAGACACCG 


TCTCAGCAAG GCCGGGGGCA 
AGAGTCGTTC CGGCCCCCGT 
hTRP-2 


GTGCACAGAG 
CACGTGTCTC 



25 6451 GTGCGAGCCG ACACAAGGCC CTGGAGTGGT CCCTACATCC TACGAAACCA 
. CACGCTCGGC TGTGTTCCGG GACCTCACCA GGGATGTAGG ATGCTTTGGT 

' hTRP-2 



6501 GGATGACCGT GAGCTGTGGC CAAGAAAATT CTTCCACCGG ACCTGCAAGT 
30 CCTACTGGCA CTCGACACCG' GTTCTTTTAA GAAGGTGGCC TGGACGTTCA 

. hTRP-2 1 



35 


6551 


GCACAGGAAA 
CGTGTCCTTT 


CTTTGCCGGC 
GAAACGGCCG 


TATAATTGTG 
ATATTAACAC 
hTRP-2 


GAGACTGCAA 
CTCTGACGTT 


GTTTGGCTGG 
CAAACCGACC 


40 


6601 


ACCGGTCCCA 
TGGCCAGGGT 


ACTGCGAGCG 
TGACGCTCGC 


GAAGAAACCA 
CTTCTTTGGT 
hTRP-2 


CCAGTGATTC 
GGTCACTAAG 


GGCAGAACAT 
CCGTCTTGTA 


6651 


■CCATTCCTTG 
GGTAAGGAAC 


AGTCCTCAGG 
TCAGGAGTCC 


AAAGAGAGCA 
TTTCTCTCGT 
hTRP-2 


GTTCTTGGGC 
CAAGAACCCG 


GCCTTAGATC 
CGGAATCTAG 


45 


6701 


TCGCGAAGAA 
AGCGCTTCTT 


GAGAGTACAC 
CTCTCATGTG 


CCCGACTACG 
GGGCTGATGC 
hTRP-2 


TGATCACCAC 
ACTAGTGGTG 


ACAACACTGG 
TGTTGTGACC 


50 


6751 


CTGGGCCTGC 
GACCCGGACG 


TTGGGCCCAA 
AACCCGGGTT 


TGGAACCCAG 
ACCTTGGGTC 
hTRP-2 • 


CCGCAGTTTG 
GGCGTCAAAC 


CCAACTGCAG 
GGTTGACGTC 



6801 TGTTTATGAT TTCTTCGTGT GGCTCCATTA TTATTCTGTT AGAGATACAT 
ACAAATACTA AAGAAGCACA CCGAGGTAAT AATAAGACAA TCTCTATGTA 

55 
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hTRP-2 

6851 TATTAGGACC AGGACGCCCC TACAGGGCCA TAG A TTTCTC ^ACATC AAGG A 
ATAATCCTGG TCCTGCGGGG ATGTCCCGGT ATCTAAAGAG JS2^ 

hTRP-2 

6901 CCTGCATTTG TTACCTGGCA CCGGTACCAT ""tTGTTGTGTC ~TGGAAAG AGA 
GGACGTAAAC AATGGACCGT GGCCATGGTA AACAACACAG SwJJSS 

hTRP-2 

6951 TCTCCAGCGA CTCATTGGCA ATGAGTCTTT^TGCTTTGCCC^TACTRraarT 
AGAGGTCGCT GAGTAACCGT TACTCAGAAA ACGaIIcgS SStS 

hTRP-2 

7001 TTGCCACTGG GAGGAACGAG TGTGATGTdrGTACAGACCrGCTCTTOrrr 
AACGGTGACC CTCCTTGCTC ACACTACACA CATGTCTGGT SS 

hTRP-2 

7051 ^GCGAGAC CAGACGATCC GACTCTGAT7AOTCGGA^crCMGAWCrC 
CGTCGCTCTG GTCTGCTAGG CTGAGACTAA TCAGCCTTGA ' GTTCTAAGAG 

hTRP-2 

7101 CAGCTGGGAA ACTGTCTGTG ATAGCTTGGA "tgACTACAAC ~CACCTGGTr a 

gtcgaccctt tgacagacac tatcgaacct actgatg??g SSSS 

hTRP-2 

7151 ccttgtgcaa tggaacctat gmggtttgc~tgagaagaaa~tcaaatggga 
- ggaacacgtt accttggata cttccaaacg actcttcttt actJJSSS 

hTRP-2 

7201 i!^ AACAGCA TGAAATTGCC MCCTTAAAA^GAGATAC^G^ATTGCtTGTC 
J CTTTGTCGT ACTTTAACGG TTGGAATTTT CTGTATGCTC TAACGgS 

hTRP-2 ~ 

7251 I^™ GAAG T1 ' TGACAR, rc C1'CCCTTCTT~CCAGA^CTCT~ACCTTCA^ 
agaggtcttc aaactgttag gagggaagaa ggtcttgaga tggaagtcaI 

hTRP-2 

7301 IrTr™^ TTTGGAAGGG TTTGATAAAG^CAGATGGGAC^TCTGG^ 

AGTCCTTACG AAACCTTCCC AAACTATTTC GTCTACCCTG AGACCTAAGA 

hTRP-2 

7351 r^™™^* gccttcata a tttggttcat~tcct^cctga~acgggaca^a 

GTTCACTACT CGGAAGTATT AAACCAAGTA AGGAAGGACT TGCCCTGtS 

hTRP-2 

7401 CGCTTTGCCA CATTCAGCCG CCAATGWCrCATCTTCGTrCTGATTTCTA 

gcgaaacggt gtaagtcggc ggttactagg gtagaagcac cactaaaga? 

hTRP-2 

7451 atcgtttgct- ttacaatgct acaacaaaca~tccttgmca~tgtaagaam 
tagcaaacga aatgttacga tgttgtttgt aggaacttgt KnSJJi 

hTRP-2 

7501 GAGAAAGCGA CCAAGGAACT CCCTTCCCTG CATGTGCTGG^TTCTTCATTC 
CTCTTTCGCT GGTTCCTTGA GGGAAGGGAC GTACACGACC AAGAAGTAAG 
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hTRP-2 

7551 CTTTACTGAT GCCATCTTTG ATGAGTGGAT GAAAAGATTT AATCCTCCTG 
GAAATGACTA CGGTAGAAAC TACTCACCTA CTTTTCTAAA TTAGGAGGAC 
5 hTRP-2 ' 

7601 CAGATGCCTG GCCTCAGGAG CTGGCCCCTA TTGGTCACAA TCGGATGTAC 
GTCTACGGAC CGGAGTCCTC GACCGGGGAT AACCAGTGTT AGCCTACATG 

hTRP-2 

10 . 

7651 . AACATGGTTC CTTTCTTCCC TCCAGTGACT AATGAAGAAC TCTTTTTAAC 
TTGTACCAAG GAAAGAAGGG AGGTCACTGA TTACTTCTTG AGAAAAATTG 

hTRP-2 

15 7701 CTCAGACCAA CTTGGCTACA GCTATGCCAT CGATCTGCCA GTTTCAGTTG 
6AGTCTGGTT GAACCGATGT CGATACGGTA GCTAGACGGT CAAAGTCAAC 

hTRP-2 

7751 AAGAAACTCC AGGTTGGCCC ACAACTCTCT TAGTAGTCAT GGGAACACTG 
20 TTCTTTGAGG TCCAACCGGG TGTTGAGAGA ATCATCAGTA CCCTTGTGAC 

hTRP-2 

* 7801 GTGGCTTTGG TTGGTCTGTT CGTGCTGTTG GCTTTTCTTC. AATATAGAAG 
CACCGAAACC AACCAGACAA GCACGACAAC CGAAAAGAAG TTATATCTTC 
25 hTRP-2 • 

7851 ACTTCGAAAA GGATATACAC CCC.TAATGGA GACACATTTA AGCAGCAAGA 

TGAAGCTTTT CCTATATGTG GGGATTACCT CTGTGTAAAT TCGTCGTTCT 
hTRP-2 

30 . 

7901 GATACACAGA AGAAGCCTAG TTTTTTAATT AAGCATGCTC TAGAATCGAT 

CTATGTGTCT TCTTCGGATC AAAAAATTAA TTCGTACGAG ATCTTAGCTA 

C5 Left Arm 



35 7951 CCCGGGTTTT TATGACTAGT TAATCACGGC CGCTTATAAA GATCTAAAAT 
GGGCCCAAAA ATACTGATCA ATTAGTGCCG GCGAATATTT CTAGATTTTA 
C5 Left Arm 

8001 GCATAATTTC TAAATAATGA AAAAAAAGTA CATCATGAGC AACGCGTTAG'. 
^0 t CGTATTAAAG ATTTATTACT TTTTTTTCAT GTAGTACTCG TTGCGCAATC 

■ C5 Left Arm 

8051 TATATTTTAC AATGGAGATT AACGCTCTAT ACCGTTCTAT GTTTATTGAT 
ATATAAAATG TTACCTCTAA TTGCGAGATA TGGCAAGATA CAAATAACTA ' 
45 C5 Left Arm 

8101 TCAGATGATG TTTTAGAAAA GAAAGTTATT GAATATGAAA ACTTTAATGA 
AGTCTACTAC AAAATCTTTT CTTTCAATAA CTTATACTTT TGAAATTACT 
C5 Left Arm 

50 

8151 AGATGAAGAT GACGACGATG ATTATTGTTG TAAATCTGTT TTAGATGAAG 
TCTACTTCTA CTGCTGCTAC TAATAACAAC ATTTAGACAA AATCTACTTC 
C5. Left Arm 

55 ' 8201 AAGATGACGC GCTAAAGTAT ACTATGGTTA CAAAGTATAA GTCTATACTA 
TTCTACTGCG CGATTTCATA TGATACCAAT GTTTCATATT CAGATATGAT 
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C5 Left Arm 

6251 CTAATGGCGA CTTGTGCAAG AAGGTATAGT ATAGTGAAAA^TGTTGTTAGA 
GATTACCGCT GAACACGTTC TTCCATATCA TATCACTTTT ACAACAATCT 
C5 Left Arm 

0301 TTATGATTAT GAAAAACCAA ATAAATCAGA TCCATATCTa""aAGGTATCTC 
AATACTAATA CTTTTTGGTT TATTTAGTCT AGGTATAGAT TTCCATAGAG 
• C5 Left Arm 

8351 CTTTGCACAT AATTTCATCT ATTCC'f AGTT TAGAATACTt'tTCATTATAT 

■ ■ GAAACGTGTA ttaaagtaga taaggatcaa atcttatgaa aagtaatata 

C5 Left Arm 

84 oi ttgtttacag ctgaagacga aaaaaatata tcgataatag "aagattatgt 
aacaaatgtc gacttctgct ttttttatat agctattatc ttctaataca 

C5 Left Arm' 

84 51- TAACTCTGCT AATAAGATGA AATTGAATGA GTCTGTGACT~GCAGCCAAGC 
ATTGAGACGA TTATTCTACT TTAACTTACT CAGACACTGA CGTCGGTTCG 
' n'^^ CTGG CCGTCGTT ™ ACAACGTCGT GACTGGGAAA ACCCTGGCGT 

AACCGTGACC GGCAGCAAAA TGTTGCAGCA CTGACCCTTT TGGGACCGCA 
8551 TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCgS 

A'J'GGGTTGAA ttagcggaac gtcgtgtagg gggaaagcgg tcgaccgcat 

8601- ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTC 
Pfiv. ' ™I^" TCT CCGGGCGTGG CTAGCGGGAA GGGTTGTCAA CGCGTCGGAC 
8651 AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 

TTACCGCTTA CCGCGGACTA CGCCATAAAA GAGGAATGCG TAGACACGCC 
8701 TATTTCACAC CGCATATGGT GCACTCTCAG TACAATCTGC TCTGA^GCCG 
Rim ^^ GTGTG GCGTATACCA CGTGAGAGTC ATGTTAGACG AGACTACGGC 
8751 CATAGTTAAG CCAGCCCCGA CACCCGCCAA CACCCGCTGA CGCGCCCTGA 

GTATCAATTC GGTCGGGGCT GTGGGCGGTT GTGGGCGACT GCGCGGGACT 
0001 CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG TGACCGTCTC 
o„c, GCCCGAACAG ACGAGGGCCG. TAGGCGAATG TCTGTTCGAC ACTGGCAGAG 
0851 CGGGAGCTGC ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA 
Dn n, ' GCCCTCGAC <3 TACACAGTCT CCAAAAGTGG CAGTAGTGGC TTTGCGCGCT 
B901 GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA 
anc , CTGCTTTCCC GGAGCACTAT GCGGATAAAA ATATCCAATT ACAGTACTAT 
8951 ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG 

TATTACCAAA GAATCTGCAG TCCACCGTGA AAAGCCCCTT TACACGCGCC 
9001 AACCCCTATT TGTTTATTTT TCTAAATACA TTCAAATATG TATCCGCTCA 
ln „ " GGG S flTAA ACAAATAAAA AGATTTATGT AAGTTTATAC ATAGGCGAGT 
9051 TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT 
ACTCTGTTAT TGGGACTATT TACGAAGTTA TTATAACTTT TTCCTTCTCA 

Amp(R) 

9101 ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTC CCT^ TT ~TTGCGGCA TT 
TACTCATAAG TTGTAAAGGC ACAGCGGGAA TAAGGGAAAA AACGCCGTAA 

Amp(R) 

9151 TTGCCTTCCT- GTTTTTGCTC ACCCAGAAAC~GCTGGTGA^~GTAAAAGATG ' 
AACGGAAGGA CAAAAACGAG TGGGTCTTTG CGACCACTTT CATTTTCTAC 

Amp(R) 

9201 CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACt'gGATCTCAAC 
GACTTCTAGT CAACCCACGT GCTCACCCAA TGTAGCTTGA CCTAGAGTTG 
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Amp (R) 



5 




AGCGGTAAGA 
TCGCCATTCT 


TCCTTGAGAG 
AGGAACTCTC 


TTTTCGCCCC GAAGAACGTT TTCCAATGAT 
AAAAGCGGGG CTTCTTGCAA AAGGTTACTA 
Amp(R) ' 


10 




GAGCACTTTT 
CTCGTGAAAA 


AAAGTTCTGC 
TTTCAAGACG 


TATGTGGCGC 
ATACACCGCG 
Amp (R> 


GGTATTATCC 
CCATAATAGG 


CGTATTGACG 
GCATAACTGC 


Ji 


CCGGGCAAGA 
GGCCCGTTCT 


GCAACTCGGT 
CGTTGAGCCA 


CGCCGCATAC ACTATTCTCA 
GCGGCGTATG TGATAAGAGT 
Amp(R) 


GAATGACTTG 
CTTACTGAAC 


15 




GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
CAACTCATGA GTGGTCAGTG TCTTTTCGTA GAATGCCTAC CGTACTGTCA 

Amp (R) 


20 




AAGAGAATTA 
TTCTCTTAAT 


TGCAGTGCTG 
ACGTCACGAC 


CCATAACCAT 
GGTATTGGTA 
Amp (R) 


GAGTGATAAC 
CTCACTATTG 


ACTGCGGCCA 
TGACGCCGGT 


25 


rDUJL- 


ACTTACTTCT 
TGAATGAAGA 


GACAACGATC 
CTGTTGCTAG 


GGAGGACCGA 
CCTCCTGGCT 
Amp (R) 


AGGAGCTAAC 
TCCTCGATTG 


CGCTTTTTTG 
GCGAAAAAAC 


30 


Q • 


CACAACATGG 
GTGTTGTACC 


GGGATCATGT 
CCCTAGTACA 


AACTCGCCTT 
TTGAGCGGAA 
Amp (R) 


GATCGTTGGG 
CTAGCAACCC 


AACCGGAGCT 
TTGGCCTCGA 


youi 


GAATGAAGCC 
CTTACTTCGG 


ATACCAAACG 
TATGGTTTGC 


ACGAGCGTGA 
TGCTCGCACT 
Amp (R) 


CACCACGATG 
GTGGTGCTAC 


CCTGTAGCAA 
GGACATCGTT 


35 


y dj x 


TGGCAACAAC 
ACCGTTGTTG 


GTTGCGCAAA 
CAACGCGTTT 


CTATTAACTG 
GATAATTGAC 
Amp (R) 


GCGAACTACT 
CGCTTGATGA 


TACTCTAGCT 
ATGAGATCGA 


40 


■ q*7 ni 


TCCCGGCAAC 
AGGGCCGTTG 


AATTAATAGA 
TTAATTATCT 


CTGGATGGAG 
•GACCTACCTC 
Amp (R) 


GCGGATAAAG 
CGCCTATTTC 


TTGCAGGACC. 
AACGTCCTGG 


45 


9751 


ACTTCTGCGC 
TGAAGACGCG 


TCGGCCCTTC 
AGCCGGGAAG 


CGGCTGGCTG 
GCCGACCGAC 
Amp (R) 


GTTTATTGCT 
CAAATAACGA 


GATAAATCTG 
CTATTTAGAC 


50 


9001 


GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
CTCGGCCACT CGCACCCAGA GCGCCATAGT AACGTCGTGA CCCCGGTCTA 

Amp (R) 


9851 


GGTAAGCCCT 
CCATTCGGGA 


CCCGTATCGT 
GGGCATAGCA 


AGTTATCTAC 
TCAATAGATG 
Amp (R) 


ACGACGGGGA 
TGCTGCCCCT 


GTCAGGCAAC 
CAGTCCGTTG 


55 • 


9901 


TATGGATGAA 
ATACCTACTT 


CGAAATAGAC 
GCTTTATCTG 


AGATCGCTGA 
TCTAGCGACT 


GATAGGTGCC 
CTATCCACGG 


TCACTGATTA 
AGTGACTAAT 
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Amp{R) 



10 



15 



20 



9951 
10001 
10051 
10101 
10151 
10201 
10251 
10301 
10351 
10401 
10451 
25 10501 
10551 
10601 
10651 
10701 
10751 ' 
10801 
10851 
10901 
10951 
45 11 001 
11051 
11101 
11151 



30 



35 



40 



50 



AGCATTGGTA 
TCGTAACCAT 
TTAAAACTTC 
AATTTTGAAG 
TAATCTCATG 
ATTAGAGTAC 
CAGACCCCGT 
GTCTGGGGCA 
CGCGTAATCT 
GCGCATTAGA 
TTGTTTGCCG 
AACAAACGGC 
TCAGCAGAGC 
AGTCGTCTCG 
GGCCACCACT 
CCGGTGGTGA 
AATCCTGTTA 
TTAGGACAAT 
GGTTGGACTC 
CCAA'CCTGAG 
ACGGGGGGTT 
TGCCCCCCAA 
ACTGAGATAC 
TGACTCTATG 
GGAGAAAGGC 
CCTCTTTCCG 
CGCACGAGGG 
GCGTGCTCCC 
CGGGTTTCGC 
GCCCAAAGCG 
GGGGGCGGAG 
CCCCCGCCTC 
CTGGCCTTTT 
GACCGGAAAA 
TGATTCTGTG 
ACTAAGACAC 
GCCGCAGCCG 
CGGCGTCGGC 
GAGCGCCCAA 
CTCGCGGGTT 
ATGCAGCTGG 
TACGTCGACC 
ACGCAATTAA 
TGCGTTAATT 
TTTATGCTTC 
AAATACGAAG 
TCACACAGGA 
AGTGTGTCCT 
ATTCTAAG 



ACTGTCAGAC 

TGACAGTCTG 

ATTTTTAATT 

TAAAAATTAA 

ACCAAAATCC 

TGGTTTTAGG 

AGAAAAGATC 

TCTTTTCTAG 

GCTGCTTGCA 

CGACGAACGT 

GATCAAGAGC 

CTAGTTCTCG 

GCAGATACCA 

CGTCTATGGT 

TCAAGAACTC 

AGTTCTTGAG 

CCAGTGGCTG 

GGTCACCGAC 

AAGACGATAG 

TTCTGCTATC 

CGTGCACACA 

GCACGTGTGT 

CTACAGCGTG 

GATGTCGCAC 

GGACAGGTAT 

CCTGTCCATA 

AGCTTCCAGG 

TCGAAGGTCC 

CACCTCTGAC 

GTGGAGACTG 

CCTATGGAAA 

GGATACCTTT 

GCTGGCCTTT 

CGACCGGAAA 

GATAACCGTA 

CTATTGGCAT 

AACGACCGAG 

TTGCTGGCTC 

TACGCAAACC 

ATGCGTTTGG 

CACGACAGGT 

GTGCTGTCCA 

TGTGAGTTAG 

ACACTCAATC 

CGGCTCGTAT 

GCCGAGCATA 

AACAGCTATG 

TTGTCGATAC 



CAAGTTTACT 
GTTCAAATGA 
TAAAAGGATC 
ATTTTCCTAG 
CTTAACGTGA 
GAATTGCACT 
AAAGGATCTT 
TTTCCTAGAA 
AACAAAAAAA 
TTGTTTTTTT 
TACCAACTCT 
ATGGTTGAGA 
AATACTGTCC 
TTATGACAGG 
TGTAGCACCG 
ACATCGTGGC 
CTGCCAGTGG 
GACGGTCACC 
TTACCGGATA 
AATGGCCTAT 
GCCCAGCTTG 
CGGGTCGAAC 
AGCTATGAGA 
TCGATACTCT 
CCGGTAAGCG. 
GGCCATTCGC 
GGGAAACGCC 
CCCTTTGCGG 
TTGAGCGTCG 
AACTCGCAGC 
AACGCCAGCA 
TTGCGGTCGT 
TGCTCACATG 
ACGAGTGTAC 
TTACCGCCTT 
AATGGCGGAA 
CGCAGCGAGT 
GCGTCGCTCA 
GCCTCTCCCC 
CGGAGAGGGG 
TTCCCGACTG 
AAGGGCTGAC 
CTCACTCATT 
GAGTGAGTAA 
GTTGTGTGGA 
CAACACACCT 
ACCATGATTA 
TGGTACTAAT 



CATATATACT 
GTATATATGA 
TAGGTGAAGA 
ATCCACTTCT 
GTTTTCGTTC 
CAAAAGCAAG 
CTTGAGATCC 
GAACTCTAGG 
CCACCGCTAC 
GGTGGCGATG 
TTTTCCGAAG 
AAAAGGCTTC 
TTCTAGTGTA 
AAGATCACAT 
CCTACATACC 
GGATGTATGG 
CGATAAGTCG 
GCTATTCAGC 
AGGCGCAGCG 
TCCGCGTCGC 
GAGCGAACGA 
CTCGCTTGCT 
AAGCGCCACG 
TTCGCGGTGC. 
GCAGGGTCGG 
CGTCCCAGCC 
TGGTATCTTT 
ACCATAGAAA 
ATTTTTGTGA 
TAAAAACACT 
ACGCGGCCTT 
TGCGCCGGAA 
TTCTTTCCTG 
AAGAAAGGAC 
TGAGTGAGCT 
ACTCACTCGA 
CAGTGAGCGA 
GTCACTCGCT 
GCGCGTTGGC 
CGCGCAACCG 
GAAAGCGGGC 
CTTTCGCCCG 
AGGCACCCCA 
TCCGTGGGGT 
ATTGTGAGCG 
TAACACTCGC 
CGAATTGAAT 
GCTTAACTTA 



TTAGATTGAT 
AATCTAACTA 
TCCTTTTTGA 
AGGAAAAACT 
CACTGAGCGT 
GTGACTCGCA 
TTTTTTTCTG 
AAAAAAAGAC 
CAGCGGTGGT 
GTCGCCACCA 
GTAACTGGCT 
CATTGACCGA 
GCCGTAGTTA" 
CGGCATCAAT 
TCGCTCTGCT 
AGCGAGACGA 
TGTCTTACCG 
ACAGAATGGC 
CTCGGGCTGA. 
CAGCCCGACT 
CCTACACCGA 
GGATGTGGCT 
CTTCCCGAAG 
GAAGGGCTTC 
AACAGGAGAG 
TTGTCCTCTC 
ATAGTCCTGT 
TATCAGGACA 
TGCTCGTCAG 
ACGAGCAGTC 
TTTACGGTTC 
AAATGCCAAG 
CGTTATCCCC 
GCAATAGGGG 
GATACCGCTC 
CTATGGCGAG 
GGAAGCGGAA". 
CCTTCGCCTT 
CGATTCATTA 
•GCTAAGTAAT . 
AGTGAGCGCA 
TCACTCGCGT 
GGCTTTACAC 
CCGAAATGTG 
GATAACAATT 
CTATTGTTAA 
TGCGGCCGCA 
ACGCCGGCGT- 
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FIGURES 
DNA Sequence of donor plasmid pT3217 

C5 Right Arm 

1 TGAATGTTAA ATGTTATACT TTGGATGAAG CTATAAATAT GCATTGGAAA 
. ACTTACAATT TACAATATGA AAGCTACTTC GATATTTATA CGTAACCTTT 
C5 Right Arm 

!0 51 AATAATCCAT TTAAAGAAAG GATTCAAATA CTACAAAACC TAAGCGATAA 

' TTATTAGGTA AATTTCTTTC CTAAGTTTAT GATGTTTTGG ATTCGCTATT 
C5 Right Arm 

101 TATGTTAACT AAGCTTATTC TTAACGACGC TTTAAATATA CACAAATAAA 
15 ATACAATTGA TTCGAATAAG AATTGCTGCG AAATTTATAT GTGTTTATTT 

C5 Right Arm 

151 CATAATTTTT GTATAACCTA ACAAATAACT AAAACATAAA AATAATAAAA 
GTATTAAAAA CATATTGGAT TGTTTATTGA TTTTGTATTT TTATTATTTT 
20 C5 Right Arm 

'201 GGAAATGTAA TATCGTAATT ATTTTACTCA GGAATGGGGT TAAATATTTA 
CCTTTACATT ATAGCATTAA TAAAATGAGT CCTTACCCCA ATTTATAAAT 
C5 Right Arm 

25 

251 TATCACGTGT ATATCTATAC TGTTATCGTA TACTCTTTAC AATTACTATT 
ATAGTGCACA TATAGATATG ACAATAGCAT ATGAGAAATG TTAATGATAA 
C5 Right Arm 

30 301 ACGAATATGC AAGAGATAAT AAGATTACGT ATTTAAGAGA ATCTTGTCAT 

TGCTTATACG TTCTCTATTA TTCTAATGCA TAAATTCTCT TAGAACAGTA 
C5 Right Arm 

351 GATAATTGGG TACGACATAG TGATAAATGC TATTTCGCAT CGTTACATAA 
35 CTATTAACCC ATGCTGTATC ACTATTTACG ATAAAGCGTA GCAATGTATT 

C5 Right Arm 

4 01 AGTCAGTTGG AAAGATGGAT TTGACAGATG TAACTTAATA GGTGCAAAAA 
TCAGTCAACC TTTCTACCTA AACTGTCTAC ATTGAATTAT CCACGTTTTT 
•40 C5 Right Arm 

451 TGTTAAATAA CAGCATTCTA TCGGAAGATA GGATACCAGT TATATTATAC 
ACAATTTATT GTCGTAAGAT AGCCTTCTAT CCTATGGTCA ATATAATATG 
C5 Right Arm 

45 ~ - ... ... 

501 AAAAATCACT GGTTGGATAA AACAGATTCT GCAATATTCG TAAAAGATGA 
TTTTTAGTGA CCAACCTATT TTGTCTAAGA CGTTATAAGC ATTTTCTACT 
' C5 Right Arm 

50 551 AGATTACTGC GAATTTGTAA ACTATGACAA TAAAAAGCCA TTTATCTCAA 

TCTAATGACG CTTAAACATT TGATACTGTT ATTTTTCGGT AAATAGAGTT 
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C5 Right Arm 



5 


601 


CGACATCGTG 
GCTGTAGCAC 


TAATTCTTCC ATGTTTTATG 
ATTAAGAAGG TACAAAATAC 
C5 Right Arm 


TATQTGTTTC AGATATTATG 
ATACACAAAG TCTATAATAC 


.10 


651 


AGATTACTAT 
TCTAATGATA 


AAACTTTTTG TATACTTATA' TTCCGTAAAC TATATTAATC 
TTTGAAAAAC ATATGAATAT AAGGCATTTG ATATAATTAG 
■ C5 Right Arm 


• 701 


ATGAAGAAAA TGAAAAAGTA TAGAAGCTGT TCACGAGCGG TTGTTGAAAA 
TACTTCTTTT ACTTTTTCAT AT.CTTCGACA AGTGCTCGCC AACAACTTTT 
C5 Right Arm 


15 


751 


CAACAAAATT 
GTTGTTTTAA 


ATACATTCAA GATGGCTTAC 
TATGTAAGTT CTACCGAATG 
C5 Right Arm 


ATATACGTCT GTGAGGCTAT 
TATATGCAGA CACTCCGATA 


20" 


801 


CATGGATAAT 
GTACCTATTA 


GACAATGCAT CTCTAAATAG 
CTGTTACGTA GAGATTTATC 
C5 Right Arm 


GTTTTTGGAC AATGGATTCG 
CAAAAACCTG TTACCTAAGC 


25 


851 


ACCCTAACAC 
TGGGATTGTG 


GGAATATGGT ACTCTACAAT 
CCTTATACCA TGAGATGTTA 
. C5 Right Arm 


CTCCTCTTGA AATGGCTGTA 
GAGGAGAACT TTACCGACAT 


30 


901 


ATGTTCAAGA 
TACAAGTTCT 


ATACCGAGGC TATAAAAATC 
TATGGCTCCG ATATTTTTAG 
C5 Right Arm 


TTGATGAGGT ATGGAGCTAA 
AACTACTCCA TACCTCGATT 


951 


ACCTGTAGTT 
TGGACATCAA 


ACTGAATGCA CAACTTCTTG 
TGACTTACGT GTTGAAGAAC 
C5 Right Arm 


TCTGCATGAT GCGGTGTTGA 
AGACGTACTA CGCCACAACT 


35 


1001 


GAGACGACTA CAAAATAGTG AAAGATCTGT TGAAGAATAA CTATGTAAAC 
CTCTGCTGAT GTTTTATCAC TTTCTAGACA , ACTTCTTAT'T GATACATTTG 
C5 Right Arm 


40 


1051 


AATGTTCTTT 
TTACAAGAAA 


ACAGCGGAGG CTTTACTCCT 
TGTCGCCTCC GAAATGAGGA 
C5 Right Arm 


TTGTGTTTGG CAGCTTACCT 
AACACAAACC GTCGAATGGA 


45 


1101- 


TAACAAAGTT. 
ATTGTTTCAA 


AATTTGGTTA AACTTCTATT 
TTAAACCAAT TTGAAGATAA 
C5 Right Arm 


GGCTCATTCG GCGGATGTAG 
CCGAGTAAGC CGCCTACATC 


50 


1151 


ATATTTCAAA 
TATAAAGTTT 


CACGGATCGG TTAACTCCTC 
GTGCCTAGCC AATTGAGGAG 
C5 Right. Arm 


TACATATAGG CGTATCAAAT 
ATGTATATCG GCATAGTTTA 


■ 1201 


AAAAATTTAA - CAATGGTTAA ACTTCTATTG AACAAAGGTG CTGATACTGA 
TTTTTAAATT GTTACCAATT TGAAGATAAC TTGTTTCCAC GACTATGACT 
C5 Right Arm 



55 1251 • CTTGCTGGAT AACATGGGAT GTACTCCTTT AATGATCGCT GTACAATCTG 



GAACGACCTA TTGTACCCTA CATGAGGAAA TTACTAGCGA CATGTTAGAC 
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C5 Right Arm 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



1301 


GAAATATTGA AATATGTAGC ACACTACTTA AAAAAAATAA AATGTCCAGA 
CTTTATAACT TTATACATCG TGTGATGAAT TTTTTTTATT TTACAGGTCT 
C5 Right Arm 


1351 


ACTGGGAAAA ATTGATCTTG CCAGCTGTAA TTCATGGTAG AAAAGAAGTG 
TGACCCTTTT TAACTAGAAC GGTCGACATT AAGTACCATC TTTTCTTCAC 
C5 Right Arm 


1401 


CTCAGGCTAC TTTTCAACAA AGGAGCAGAT GTAAACTACA TCTTTGAAAG 
GAGTCCGATG AAAAGTTGTT TCCTCGTCTA CATTTGATGT AGAAACTTTC 
C5 Right Arm 


1451 


AAATGGAAAA TCATATACTG TTTTGGAATT GATTAAAGAA AGTTACTCTG 
TTTACCTTTT AGTATATGAC AAAACCTTAA CTAATTTCTT TCAATGAGAC 
C5 Right Arm 


1501 


AGACACAAAA 
TCTGTGTTTT 


GAGGTAGCTG AAGTGGTACT CTCAAAGGTA 
CTCCATCGAC TTCACCATGA GAGTTTCCAT 
Repeat Region 


CGTGACTAAT 
GCACTGATTA 


1551 


TAGCTATAAA 
ATCGATATTT 


AAGGATCGGG TTCTTTATTC TATACTTAAA 
TTCCTAGCCC AAGAAATAAG ATATGAATTT 
Repeat Region 


AAGTGAAAAT 
TTCACTTTTA 


1601 


AAATACAAAG 
TTTATGTTTC 


GTTCTTGAGG GTTGTGTTAA ATTGAAAGCG 
CAAGAACTCC CAACACAATT TAACTTTCGC 
Repeat Region 


AGAAATAATC 
TCTTTATTAG 


1651 


ATAAATTATT 
TATTTAATAA 


TCATTATCGC GATATCCGTT AAGTTTGTAT 
AGTAATAGCG CTATAGGCAA TTCAAACATA 
Repeat Region . 


CGTAATCTGC 
GCATTAGACG 


1701 


AGCCCCCACC ATGGATCTGG TGCTAAAAAG ATGCCTTCTT CATTTGGCTG 
TCGGGGGTGG TACCTAGACC ACGATTTTTC TACGGAAGAA GTAAACCGAC 
Repeat Region 


1751 


TGATAGGTGC 
ACTATCCACG 


TTTGCTGGCT GTGGGGGCTA CAAAAGTACC 
AAACGACCGA CACCCCCGAT GTTTTCATGG 
Repeat Region 


CAGAAACCAG. 
GTCTTTGGTC 


1801 


GACTGGCTTG 
CTGACCGAAC 


GTGTCTCAAG GCAACTCAGA ACCAAAGCCT 
CACAGAGTTC CGTTGAGTCT TGGTTTCGGA 
Repeat Region 


GGAACAGGCA 
CCTTGTCCGT 


1851 


GCTGTATCCA 
CGACATAGGT 


GAGTGGACAG AAGCCCAGAG ACTTGACTGC 
CTCACCTGTC TTCGGGTCTC TGAACTGACG 
Repeat Region 


TGGAGAGGTG 
ACCTCTCCAC 


1901 


GTCAAGTGTC 
CAGTTCACAG 


CCTCAAGGTC AGTAATGATG GGCCTACACT 
GGAGTTCCAG TCATTACTAC CCGGATGTGA 
Repeat Region 


GATTGGTGCA 
CTAACCACGT 



1951 



AATGCCTCCT TCTCTATTGC CTTGAACTTC CCTGGAAGCC AAAAGGTATT 
TTACGGAGGA AGAGATAACG GAACTTGAAG GGACCTTCGG TTTTCCATAA 
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Repeat Region C1B promoter 



2001 GCCAGATACT AGTTCTAGAG GATCATTATT TAACGTAAAC TAAATGGAAA 
CGGTCTATGA TCAAGATCTC CTAGTAATAA ATTGCATTTG ATTTACCTTT 
C1B promoter 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2051 • 


AGCTATTTAC AGGTACATAC GGTGTTTTTC TGGAATCAAA 
TCGATAAATG TCCATGTATG CCACAAAAAG ACCTTAGTTT 
• C1B promoter 


TGATTCTGAT 
ACTAAGACTA 


2101 


TTTGAGGATT TTATCAATAC AATAATGACA GTGCTAACTG 
AAACTCCTAA AATAGTTATG TTATTACTGT CACGATTGAC 
C1B promoter 


GTAAAAAAGA 
CATTTTTTCT 


2151 


AAGCAAACAA TTATCATGGC TAACAATTTT TATTATATTT GTAGTATGCA 
TTCGTTTGTT AATAGTACCG ATTGTTAAAA ATAATATAAA CATCATACGT 
C1B promoter 


2201 


TAGTGGTCTT TACGTTTCTT TATTTAAAGT TAATGTGTTA AGATTAAATG 
ATCACCAGAA ATGCAAAGAA ATAAATTTCA ATTACACAAT' TCTAATTTAC 
C1B promoter LacZ 


2251 


. GAGTAATTGG ATCCCCCATC GATGGGGAAT TCACTGGCCG 
" CTCATTAACC TAGGGGGTAG CTACCCCTTA AGTGACCGGC 

LacZ 


TCGTTTTACA 
AGCAAAATGT 


2301 


ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 
TGCAGCACTG ACCCTTTTGG GACCGCAATG GGTTGAATTA 

LacZ 


CGCCTTGCAG 
GCGGAACGTC 


2351 


CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 
GTGTAGGGGG AAAGCGGTCG ACCGCATTAT CGCTTCTCCG GGCGTGGCTA 

LacZ 


2401 


CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGGC 
GCGGGAAGGG TTGTCAACGC GTCGGACTTA CCGCTTACCiG 

LacZ 


GCTTTGCCTG 
CGAAACGGAC 


24 51 


GTTTCCGGCA CCAGAAGCGG TGCCGGAAAG CTGGCTGGAG 
'CAAAGGCCGT GGTCTTCGCC ACGGCCTTTC GACCGACCTC 

LacZ 


TGCGATCTTC 
ACGCTAGAAG 


2501 


CTGAGGCCGA TACTGTCGTC GTCCCCTCAA ACTGGCAGAT 
GACTCCGGCT ATGACAGCAG CAGGGGAGTT TGACCGTCTA 

LacZ 


GCACGGTTAC 
CGTGCCAATG 


2551 


GATGCGCCCA TCTACACCAA CGTGACCTAT CCCATTAtGG TCAATCCGCC 
ctac'gcgggt. AGATGTGGTT gcactggata GGGTAATGCC AGTTAGGCGG 

LacZ. 


2601 


GTTTGTTCCC ACGGAGAATC CGACGGGTTG TTACTCGCTC 
CAAACAAGGG TGCCTCTTAG GCTGCCCAAC AATGAGCGAG 

LacZ 


ACATT7AATG 
TGTAAATTAC 


2651 


TTGATGAAAG CTGGCTACAG GAAGGCCAGA CGCGAATTAT 
AACTACT.TTC GACCGATGTC CTTCCGGTCT GCGCTTAATA 


TTTTGATGGC 
AAAACTACCG 
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LacZ 



5 


2701 


GTTAACTCGG 
CAATTGAGCC 


CGTTTCATCT 
GCAAAGTAGA 


GTGGTGCAAC 
CACCACGTTG 
LacZ 


GGGCGCTGGG 
CCCGCGACCC 


TCGGTTACGG 
AGCCAATGCC 


JO 


2751 


CCAGGACAGT 
GGTCCTGTCA 


CGTTTGCCGT 
GCAAACGGCA 


CTGAATTTGA 
GACTTAAACT 
LacZ 


CCTGAGCGCA 
GGACTCGCGT 


TTTTTACGCG 
AAAAATGCGC 


2801 


CCGGAGAAAA CCGCCTCGCG GTGATGGTGC TGCGCTGGAG TGACGGCAGT 
GGCCTCTTTT GGCGGAGCGC CACTACCACG ACGCGACCTC ACTGCCGTCA 

LacZ 


15 


2051 


TATCTGGAAG 
ATAGACCTTC 


ATCAGGATAT 
TAGTCCTATA 


GTGGCGGATG 
CACCGCCTAC 
LacZ 


AGCGGCATTT 
TCGCCGTAAA 


TCCGTGACGT 
AGGCACTGCA 


20 


2901 


' CTCGTTGCTG 
GAGCAACGAC 


CATAAACCGA 
GTATTTGGCT 


CTACACAAAT CAGCGATTTC 
GATGTGTTTA GTCGCTAAAG 
LacZ 


CATGTTGCCA 
' GTACAACGGT 


'25 


2951 


CTCGCTTTAA 
GAGCGAAATT 


TGATGATTTC 
ACTACTAAAG 


AGCCGCGCTG 
TCGGCGCGAC 
LacZ 


TACTGGAGGC 
ATGACCTCCG 


TGAAGTTCAG 
ACTTCAAGTC 


30 


. 3001 


ATGTGCGGCG 
TACACGCCGC 


AGTTGCGTGA 
TCAACGCACT 


CTACCTACGG 
GATGGATGCC 
LacZ 


GTAACAGTTT. CTTTATGGCA 
CATTGTCAAA GAAATACCGT 


3051 


■ GGGTGAAACG 
CCCACTTTGC 


CAGGTCGCCA 
GTCCAGCGGT 


GCGGCACCGC 
CGCCGTGGCG 
LacZ 


GCCTTTCGGC 
CGGAAAGCCG 


GGTGAAATTA 
CCACTTTAAT 


35 


.3101 


TCGATGAGCG TGGTGGTTAT GCCGATCGCG TCACACTACG TCTGAACGTC 
AGCTACTCGC ACCACCAATA CGGCTAGCGC AGTGTGATG'C AGACTTGCAG 

LacZ 


40 


' 3151 


GAAAACCCGA 
CTTTTGGGCT 


AACTGTGGAG 
TTGACACCTC 


CGCCGAAATC 
GCGGCTTTAG 
LacZ 


CCGAATCTCT 
GGCTTAGAGA 


ATCGTGCGGT 
TAGCACGCCA 


45 




GGTTGAACTG 
CCAACTTGAC 


CACACCGCCG 
GTGTGGCGGC 


ACGGCACGCT 
TGCCGTGCGA 
LacZ 


GATTGAAGCA 
CTAACTTCGT 


GAAGCCTGCG 
CTTCGGACGC 


50 


3251 


ATGTCGGTTT 
TACAGCCAAA 


CCGCGAGGTG 
GGCGCTCCAC 


CGGATTGAAA 
GCCTAACTTT 
LacZ 


ATGGTCTGCT 
TACCAGACGA 


GCTGCTGAAC 
CGACGACTTG 


3301 


GGCAAGCCGT' TGCTGATTCG AGGCGTTAAC 
CCGTTCGGCA ACGACTAAGC TCCGCAATTG 

LacZ 


CGTCACGAGC ATCATCCTCT 
GCAGTGCTCG TAGTAGGAGA 


55 


3351 


GCATGGTCAG 
CGTACCAGTC 


GTCATGGATG 
CAGTACCTAC 


AGCAGACGAT 
TCGTCTGCTA 


GGTGCAGGAT 
CCACGTCCTA 


ATCCTGCTGA 
TAGGACGACT 
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LacZ 

3401 TGAAGCAGAA CAACTTTAAC GCCGTGCGCT GTTCGCATTA TCCGAACCAT 
ACTTCGTCTT GTTGAAATTG CGGCACGCGA CAAGCGTAAT AGGCTTGGTA ' 

LacZ 



34 51 CCGCTGTGGT ACACGCTGTG CGACCGCTAC GGCCTGTATG TGGTGGATGA 
GGCGACACCA TGTGCGACAC GCTGGCGATG CCGGACATAC ACCACCTACT 

LacZ 



3501 AGCCAATATT GAAACCCACG GCATGGTGCC AATGAATCGT CTGACCGATG 
TCGGTTATAA CTTTGGGTGC CGTACCACGG TTACTTAGCA GACTGGCTAC 

LacZ 



3551 ATCCGCGCTG GCTACCGGCG ATGAGCGAAC GCGTAACGCG AATGG.TGCAG 
TAGGCGCGAC CGATGGCCGC TACTCGCTTG CGCATTGCGC TTACCACGTC 

LacZ 



3601 CGCGATCGTA ATCACCCGAG TGTGATCATC TGGTCGCTGG GGAATGAATC 
GCGCTAGCAT TAGTGGGCTC ACACTAGTAG ACCAGCGACC CCTTACTTAG 

LacZ 



3651 AGGCCACGGC GCTAATCACG ACGCGCTGTA TCGCTGGATC AAATCTGTCG 
TCCGGTGCCG CGATTAGTGC TGCGCGACAT AGCGACCTAG TTTAGACAGC 

LacZ 



3701 ATCCTTCCCG CCCGGTGCAG TATGAAGGCG GCGGAGCCGA CACCACGGCC 
TAGGAAGGGC GGGCCACGTC ATACTTCCGC CGCCTCGGCT GTGGTGCCGG 

LacZ 



3751 ACCGATATTA TTTGCCCGAT GTACGCGCGC GTGGATGAAG ACCAGCCCTT 
TGGCTATAAT AAACGGGCTA CATGCGCGCG CACCTACTTC TGGTCGGGAA 

LacZ 



3001 CCCGGCTGTG CCGAAATGGT CCATCAAAAA ATGGCTTTCG CTACCTGGAG 
GGGCCGACAC GGCTTTACCA GGTAGTTTTT TACCGAAAGC GATGGACCTC ■ 

LacZ 



3851 AGACGCGCCC GCTGATCCTT TGCGAATACG CCCACGCGAT GGGTAACAGT. 

TCTGCGCGGG CGACTAGGAA ACGCTTATGC GGGTGCGCTA CCCATTGTCA 

LacZ 



3901 CTTGGCGGTT TCGCTAAATA CTGGCAGGCG TTTCGTCAGT ATCCCCGTTT . 

GAACCGCCAA AGCGATTTAT GACCGTCCGC AAAGCAGTCA TAGGGGCAAA ' 

LacZ 



3951 ACAGGGCGGC TTCGTCTGGG ACTGGGTGGA TCAGTCGCTG ATTAAATATG 
TGTCCCGCCG AAGCAGACCC TGACCCACCT AGTCAGCGAC TAATTTATAC 

LacZ 



4 001 ATGAAAACGG CAACCCGTGG TCGGCTTACG GCGGTGATTT TGGCGATACG 
TACTTTTGCC GTTGGGCACC AGCCGAATGC CGCCACTAAA ACCGCTATGC 

LacZ 



4 051 CCGAACGATC GCCAGTTCTG TATGAACGGT CTGGTCTTTG CCGACCGCAC 
GGCTTGCTAG CGGTCAAGAC ATACTTGCCA GACCAGAAAC GGCTGGCGTG 
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LacZ 



.10 



15 



4101 



.4151 



4201 



4251 



GCCGCATCCA GCGCTGACGG AAGCAAAACA CCAGCAGCAG TTTTTCCAGT 
CGGCGTAGGT CGCGACTGCC TTCGTTTTGT GGTCGTCGTC AAAAAGGTCA 

LacZ 

TCCGTTTATC CGGGCAAACC ATCGAAGTGA CCAGCGAATA CCTGTTCCGT 
AGGCAAATAG GCCCGTTTGG TAGCTTCACT GGTCGCTTAT GGACAAGGCA 

LacZ 

CATAGCGATA ACGAGCTCCT GCACTGGATG GTGGCGCTGG ATGGTAAGCC 
GTATCGCTAT TGCTCGAGGA CGTGACCTAC. CACCGCGACC TACCATTCGG 

LacZ 

GCTGGCAAGC GGTGAAGTGC CTCTGGATGT CGCTCCACAA GGTAAACAGT 
CGACCGTTCG CCACTTCACG GAGACCTACA GCGAGGTGTT CCATTTGTCA 

LacZ 



20 



25 



4301 



4351 



TGATTGAACT GCCTGAACTA CCGCAGCCGG AGAGCGCCGG GCAACTCTGG 
ACTAACTTGA CGGACTTGAT GGCGTCGGCC TCTCGCGGCC CGTTGAGACC 

LacZ 

CTCACAGTAC GCGTAGTGCA ACCGAACGCG ACCGCATGGT CAGAAGCCGG 
GAGTGTCATG CGCATCACGT TGGCTTGCGC TGGCGTACCA GTCTTCGGCC 

LacZ 



30 



35 



4401 



4451 



4501 



GCACATCAGC GCCTGGCAGC AGTGGCGTCT GGCGGAAAAC CTCACTGTGA 
CGTGTAGTCG CGGACCGTCG TCACCGCAGA CCGCCTTTTG GAGTCACACT 

LacZ 

CGCTCCCCGC CGCGTCCCAC GCCATCCCGC ATCTGACCAC CAGCGAAATG 
GCGAGGGGCG GCGCAGGGTG CGGTAGGGCG TAGACTGGTG GTCGCTTTAC 

LacZ 

GATTTTTGCA TCGAGCTGGG TAATAAGCGT TGGCAATTTA ACCGCCAGTC 
CTAAAAACGT AGCTCGACCC ATTATTCGCA ACCGTTAAAT TGGCGGTCAG 

LacZ 



40 



4551 



AGGCTTTCTT TCACAGATGT GGATTGGCGA TAAAAAACAA CTGCTGACGC 
TCCGAAAGAA AGTGTCTACA CCTAACCGCT ATTTTTTGTT GACGACTGCG 

LacZ 



45 



50 



55 



4 601 



4651 



4701 



4751 



CGCTGCGCGA TCAGTTCACC CGTGCACCGC TGGATAACGA CATTGGCGTA 
GCGACGCGCT AGTCAAGTGG GCACGTGGCG ACCTATTGCT GTAACCGCAT 

LacZ 

AGTGAAGCGA CCCGCATTGA CCCTAACGCC TGGGTCGAAC GCTGGAAGGC 
TCACTTCGCT GGGCGTAACT GGGATTGCGG ACCCAGCTTG CGACCTTCCG 

LacZ. 

GGCGGGCCAT TACCAGGCCG AAGCAGCGTT GTTGCAGTGC ACGGCAGATA 
CCGCCCGGTA ATGGTCCGGC TTCGTCGCAA CAACGTCACG TGCCGTCTAT 

LacZ 

CACTTGCTGA TGCGGTGCTG ATTACGACCG CTCACGCGTG GCAGCATCAG 
GTGAACGACT ACGCCACGAC TAATGCTGGC GAGTGCGCAC CGTCGTAGTC 
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LacZ 




5 


4801 


GGGAAAACCT TATTTATCAG 
CCCTTTTGGA ATAAATAGTC 


CCGGAAAACC TACCGGATTG 
GGCCTTTTGG ATGGCCTAAC 
LacZ 


ATGGTAGTGG 
TACCATCACC 


10 


4851 


TCAAATGGCG 
AGTTTACCGC 


ATTACCGTTG 
TAATGGCAAC 


ATGTTGAAGT GGCGAGCGAT ACACCGCATC 
TACAACTTCA CCGCTCGCTA TGTGGCGTAG 
LacZ 


4901 


. CGGCGCGGAT TGGCCTGAAC TGCCAGCTGG CGCAGGTAGC 
GCCGCGCCTA ACCGGACTTG ACGGTCGACC GCGTCCATCG 

LacZ 


AGAGCGGGTA 
TCTCGCCCAT 


15 


4951' 


AACTGGCTCG 
TTGACCGAGC 


GATTAGGGCC 
CTAATCCCGG 


•GCAAGAAAAC TATCCCGACC 
CGTTCTTTTG ATAGGGCTGG 
LacZ 


GCCTTACTGC 
CGGAATGACG 


20 . 


5001 


CGCCTGTTTT 
GCGGACAAAA 


GACCGCTGGG 
CTGGCGACCC 


ATCTGCCATT GTCAGACATG 
TAGACGGTAA CAGTCTGTAC 
LacZ 


TATACCCCGT 
ATATGGGGCA 


25 


5051 


ACGTCTTCCC 
TGCAGAAGGG 


GAGCGAAAAC 
CTCGCTTTTG 


GGTCTGCGCT GCGGGACGCG 
CCAGACGCGA CGCCCTGCGC 
LacZ 


CGAATTGAAT 
GCTTAACTTA 


30 


5101 


TATGGCCCAC ACCAGTGGCG CGGCGACTTC CAGTTCAACA 
ATACCGGGTG TGGTCACCGC GCCGCTGAAG GTCAAGTTGT 

LacZ 


TCAGCCGGTA 
AGTCGGCCAT 


5151 


CAGTCAACAG 
GTCAGTTGTC 


CAATTGATGG 
GTTAACTACC 


AAACCAGCCA TTCGCCATCT 
TTTGGTCGGT AAGCGGTAGA 
LacZ 


GCTGCACGCG 
CGACGTGCGC 


35 


5201 • 


GAAGAGGCAC 
CTTCTCCGTG 


ATGGCTGAAT 
TACCGACTTA 


ATCGACGGTT TCCATATGGG 
TAGCTGCCAA AGGTATACCC 
LacZ 


GATTGGTGGC 
CTAACGACCG 


40 


5251 


GACGACTCCT 
CTGCTGAGGA 


GGAGCCCGTC 
CCTCGGGCAG 
. LacZ 


AGTATCGGCG GAATTCCAGC TGAGCGCCGG. 
TCATAGCCGC CTTAAGGTCG ACTCGCGGCC 



5301 TCGCTACCAT TACCAGTTGG TCTGGTGTCA AAAATAATAA TAACCGGGCA 
AGCGATGGTA ATGGTCAACC AGACCACAGT TTTTATTATT ATTGGCCCGT 
4'5 5351 GGGGGGATCC GGAGCTTATC GCAGATCAAT TCGATATCAA GCTTATCGAT 
CCCCCCTAGG CCTCGAATAG CGTCTAGTTA AGCTATAGTT CGAATAGCTA 

H6 Promoter 



54 01 ACCGTCGACC TCGAGTCTAG AATCGATCCC GGGTTCTTTA TTCTATACTT 
50 TGGCAGCTGG AGCTCAGATC TTAGCTAGGG CCCAAGAAAT AAGATATGAA 

H6 Promoter 



55 ■ 



5451 AAAAAGTGAA AATAAATACA AAGGTTCTTG AGGGTTGTGT TAAATTGAAA 
TTTTTCACTT ttATTTATGT TTCCAAGAAC TCCCAACACA ATTTAACTTT 
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.10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



H6 Promoter 

5501 GCGAGAAATA ATCATAAATT ATTTCATTAT. CGCGATATCC GTTAAGTTTG 
CGCTCTTTAT TAGTATTTAA TAAAGTAATA GCGCTATAGG CAATTCAAAC 
H6 Promoter gplOO(.M) 

5551 TATCGTMTC TGCAGCCCCC ACCATGGATC TGGTGCTAM^MGATGCCTT 
ATAGCATTAG ACGTCGGGGG TGGTACCTAG ACCACGATTT TTCTACGGAA 
gplOO(M) 

5601 CTTCATTTGG CTGTGATAGG TGCTTTGCTG GCTGTGGGGG ^CTACAAAAGT 
GAAGTAAACC GACACTATCC ACGAAACGAC CGACACCCCC GATGTTTTCA 
gpl00(M) 

5651 ACCCAGAAAC CAGGACTGGC TTGGTGTCTC AAGGCAACTC AGAACCAAAG 
TGGGTCTTTG GTCCTGACCG AACCACAGAG TTCCGTTGAG TCTTGGTTTC 
gplOO(M) 

.5701 CCTGGAACAG GCAGCTGTAT CCAGAGTGGA CAGAAGCCCA ^GAGACTTGAC 
GGACCTTGTC CGTCGACATA GGTCTCACCT GTCTTCGGGT CTCTGAACTG 
gplOO(M) 

5751 TGCTGGAGAG GTGGTCAAGT GTCCCTCAAG GT^GT^TG^ATGGGCCTAC 
ACGACCTCTC CACCAGTTCA CAGGGAGTTC CAGTCATTAC TACCCGGATG 
gplOO(M) 

5001 ACTGATTGGT GCAAATGCCT CCTTCTCTAT TGCCTTGAAC ^TTCCC TGG AA 
TGACTAACCA CGTTTACGGA GGAAGAGATA ACGGAACTTG AAGGGACCTT ' 
gplOO(M) 

5851 ■ GCCAAAAGGT ATTGCCAGAT GGGCAGGTTA TCTGGGTCAA^CAATACCATC 
CGGTTTTCCA TAACGGTCTA CCCGTCCAAT AGACCCAGTT GTTATGGTAG 
gplOO(M) 

5901 ATCAATGGGA GCCAGGTGTG GGGAGGACAG CCAGTGTATC^CCCAGGAAAC 
TAGTTACCCT CGGTCCACAC CCCTCCTGTC GGTCACATAG GGGTCCTTTG 
. gplOO(M) • 

5951 . TGACGATGCC TGCATCTTCC CTGATGGTGG ACCTTGCCCA^TCTGGCTCTT 
ACTGCTACGG ACGTAGAAGG GACTACCACC TGGAACGGGT AGACCGAGAA 
gplOO(M) 

6001 GGTCTCAGAA GAGAAGCTTT GTTTATGTCT GG AAG ACCTG ^GGGCC AATAC 
CCAGAGTCTT CTCTTCGAAA CAAATACAGA CCTTCTGGAC CCCGGTTATG 
gplOO(M) 

6051 TGGCAAGTTC TAGGGGGCCC AGTGTCTGGG^CTGAGCATTG^GGACAGGCAG 
ACCGTTCAAG ATCCCCCGGG TCACAGACCC GACTCGTAAC CCTGTCCGTC 
gplOO(M) , 

6101 GGCAATGCTCT GGCACACACA CGATGGAAGT GACTGTCTAC^CATCGCCGGG ■ 
CCGTTACGAC CCGTGTGTGT GCTACCTTCA CTGACAGATG GTAGCGGCCC 
gplOO(M) 

6151- GATCCCGGAG CTATGTGCCT CTTGCTCATT CCAGCTCAGC^CTTCACCATT 
CTAGGGCCTC GATACACGGA GAACGAGTAA GGTCGAGTCG GAAGTGGTAA 
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gplOO(M) 



6201 



ATGGACCAGG 
TACCTGGTCC 



TGCCTTTCTC CGTGAGCGTG 
ACGGAAAGAG GCACTCGCAC 
gplOO(M) 



TCCCAGTTGC GGGCCTTGGA 
AGGGTCAACG CCCGGAACCT 



10 



6251 



6301 



TGGAGGGAAC 
ACCTCCCTTG 



AAGCACTTCC TGAGAAATCA 
TTCGTGAAGG ACTCTTTAGT 
gplOO(M) 



GCCTCTGACC TTTGCCCTCC 
CGGAGACTGG AAACGGGAGG 



AGCTCCATGA 
TCGAGGTACT 



CCCCAGTGGC TATCTGGCTG 
GGGGTCACCG ATAGACCGAC 
gplOO(M) 



AAGCTGACCT CTCCTACACC 
TTCGACTGGA GAGGATGTGG 



15 



6351 



TGGGACTTTG 
ACCCTGAAAC 



GAGACAGTAG TGGAACCCTG 
CTCTGTCATC ACCTTGGGAC 
gplOQ(M) 



ATCTCTCGGG CACTTGTGGT 
TAGAGAGCCC GTGAACACCA 



20 



6601 



CACTCATACT 
GTGAGTATGA 



TACCTGGAGC CTGGCCCAGT 
ATGGACCTCG GACCGGGTCA 
gplOO(M) 



CACTGTTCAG GTGGTCCTGC 
GTGACAAGTC CACCAGGACG 



25 



6451 



AGGCTGCCAT 
TCCGACGGTA 



TCCTCTCACC TCCTGTGGCT 
AGGAGAGTGG AGGACACCGA 
gplOO(M) 



CCTCCCCAGT TCCAGGCAOC 
GGAGGGGTCA AGGTCCGTGG 



30 



6501 



6551 



ACAGATGGGC 
TGTCTACCCG 



ACAGGCCAAC TGCAGAGGCC 
TGTCCGGTTG ACGTCTCCGG 
gplOO(M) 



CCTAACACCA CAGCTGGCCA 
GGATTGTGGT GTCGACCGGT 



AGTGCCTACT 
TCACGGATGA 



ACAGAAGTTG TGGGTACTAC 
TGTCTTCAAC ACCCATGATG 
gplOO(M) 



ACCTGGTCAG GCGCCAACTG 
TGGACCAGTC CGCGGTTGAC 



35 



6601 



CAGAGCCCTC 
GTCTCGGGAG 



TGGAACCACA TCTGTGCAGG 
ACCTTGGTGT AGACACGTCC 
gplOO(M) 



TGCCAACCAC TGAAGTCATA 
ACGGTTGGTG ACTTCAGTAT 



40 



6651 



AGCACTGCAC 
TCGTGACGTG 



CTGTGCAGAT GCCAACTGCA 
GACACGTCTA CGGTTGACGT 
gplOO(M) 



GAGAGCACAG GTATGACACC. 
CTCTCGTGTC CATACTGTGG 



45 



6701 



TGAGAAGGTG 
ACTCTTCCAC 



CCAGTTTCAG AGGTCATGGG 
GGTCAAAGTC TCCAGTACCC 
gplOO(M) 



TACCACACTG GCAGAGATGT 
ATGGTGTGAC CGTCTCTACA 



50 



6751 



6801 



CAACTCCAGA 
GTTGAGGTCT 



GGCTACAGGT ATGACACCTG 
CCGATGTCCA TACTGTGGAC 
gplOO(M) 



CAGAGGTATC AATTGTGGTG 
GTCTCCATAG TTAACACCAC 



CTTTCTGGAA CCACAGCTGC ACAGGTAACA 
GAAAGACCTT GGTGTCGACG TGTCCATTGT 
gplOO(M) 



ACTACAGAGT GGGTGGAGAC 
TGATGTCTCA CCCACCTCTG 



55 



6851 



CACAGCTAGA 
GTGTCGATCT 



GAGCTACCTA TCCCTGAGCC 
CTCGATGGAT AGGGACTCGG 



TGAAGGTCCA GATGCCAGCT 
ACTTCCAGGT CTACGGTCGA 
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gplOO(M) 



6901 



!0 



15 



20 



25 



'30 



35 



40 



45 



50 



6951 



7001 



7051 



7101 



7151 



7201 



7251 



7301 



7351 



7401 



7451 



-7501 



55* 7551 



CAATCATGTC TACGGAAAGT ATTACAGGTT CCCTGGGCCC CCTGCTGGAT 
GTTAGTACAG ATGCCTTTCA TAATGTCCAA GGGACCCGGG GGACGACCTA 
gplOO(M) 

GGTACAGCCA CCTTAAGGCT GGTGAAGAGA CAAGTCCCCC TGGATTGTGT 
CCATGTCGGT GGAATTCCGA CCACTTCTCT GTTCAGGGGG ACCTAACACA 

gpioo(M) 

Ir™?° GA TATGGTTCCT TTTCCGTCAC^tCTGGACATT^GTCCAGG^ 
AGACATAGCT ATACCAAGGA AAAGGCAGTG GGACCTGTAA CAGGTCCCAT 
gplOO(M) 

ri™^ TG ° CGAGATCCTG CAGGCTGTGC CGTCCGGTG A ^GGGGGATGCA 
AACPTTCACG GCTCTAGGAC GTCCGACACG GCAGGCCACT CCCCCTACGT 
gplO.O(M) 

TTTGAGCTGA CTGTGTCCTG CCAAGGCGGG^CTGCCCMGG^^ 
AAACTCGACT GACACAGGAC GGTTCCGCCC GACGGGTTCC TTCGGACGTA 
gplOO(M) 

^^ AT " CA TCGCCAGGGT <5CCAGCCCCC TGCCCAGCGG ""cTGTGCCAGC 
CCTCTAGAGT AGCGGTCCCA CGGTCGGGGG ACGGGTCGCC GACACGGTCG 
gplOO(M) 

CTGTGCTACC CAGCCCAGCC TGCCAGCTGG TTCTG^CCA^GATACTGAAG 
GACACGATGG GTCGGGTCGG ACGGTCGACC AAGACGTGGT CTATGACTTC 
gplOO(M) 

GGTGGCTCGG GGACATACTG CCTCAATGTG TCTCTGGCTG^ATACCAACAG 
CCACCGAGCC CCTGTATGAC GGAGTTACAC AGAGACCGAC TATGGTTGTC 
gplOO(M) 

CCTGGCAGTG GTCAGCACCC AGCTTATCAT GCCTGGTCAA^GAAGCAGGCC 
GGACCGTCAC CAGTCGTGGG TCGAAYAGTA CGGACCAGTT CTTCGTCCGG 
gplOO(M) 

TTGGGCAGGT TCCGCTGATC GTGGGCATCT TGCTGGTGT^GATGGCTGTG 
AACCCGTCCA AGGCGACTAG CACCCGTAGA ACGACCACAA CTACCGACAC 
gplOO(M) 

GTCCTTGCAT CTCTGATATA TAGGCGCAGA CTTATG^GC^MGACTTCTC 
CAGGAACGTA GAGACTATAT ATCCGCGTCT GAATACTTCG TTCTGAAGAG ' 
gplOO{M) 

CGTACCCCAG TTGCCACATA GCAGCAGTCA CTGGCTGCGT ^CTACCCCGCA 
GCATGGGGTC AACGGTGTAT CGTCGTCAGT GACCGACGCA GATGGGGCGT 
gplOO(M) 

TCTTCTGCTC TTGTCCCATT GGTGAGAACA GCCCCCTCCT "cAGTGGGCAG 
AGAAGACGAG AACAGGGTAA CCACTCTTGT CGGGGGAGGA GTCACCCGTC 
9P^(MM)^ • 42K promoter 

CAGGTCTGAT TTTTATTCTA GTTCAAAAAA ATATAAATGA^TTCACCATCT ' 
GTCCAGACTA AAAATAAGAT CAAGTTTTTT TATATTTACT AAGTGGTAGA 
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42K promoter 



5 


7601 


GATAGAAAAA AAATTTATTG GGAGAATATG ATAATATTTT 
CTATCTTTTT TTTAAATAAC CCTCTTATAC TATTATAAAA 
42K promoter 


GGGATTTCAA 
CCCTAAAGTT 
Mart-1 


10 


7 651 


AATTGAAAAT 
TTAACTTTTA 


ATATAATTAC AATATAAATC TAGACCACCA 
TATATTAATG TTATATTTAG ATCTGGTGGT 
Mart-1 


TGCCAAGAGA 
ACGGTTCTCT 


7701 


AGATGCTCAC 
TCTACGAGTG 


TTCATCTATG GTTACCCCAA GAAGGGGCAC 
AAGTAGATAC CAATGGGGTT CTTCCCCGTG 
Mart-1 


GGCCACTCTT 
CCGGTGAGAA 


15 


7751 


ACACCACGGC 
TGTGGTGCCG 


TGAAGAGGCC GCTGGGATCG GCATCCTGAC 
ACTTCTCCGG CGACCCTAGC CGTAGGACTG 
Mart-1. . 


AGTGATCCTG 
TCACTAGGAC 


20 


7801 


GGAGTCTTAC 
CCTCAGAATG 


TGCTC ATCGG... CTGTTGGTAT TGTAGAAG AC 
ACGAGTAGCC GACAACCATA ACATCTTCTG 
Mart-1 


GAAATGGATA 
CTTTACCTAT 


25 


7851. 


CAGAGCCTTG ATGGATAAAA GTCTTCATGT TGGCACTCAA TGTGCCTTAA 
GTCTCGGAAC TACCTATTTT CAGAAGTACA ACCGTGAGTT ACACGGAATT 
' % Mart-1 


• 

30 


7901 


CAAGAAGATG 
GTTCTTCTAC 


CCCACAAGAA GGGTTTGATC ATCGGGACAG 
GGGTGTTCTT CCCAAACTAG TAGCCCTGTC 
Mart-1 


CAAAGTGTCT 
GTTTCACAGA 


7951 


CTTCAAGAGA 
GAAGTTCTCT 


AAAACTGTGA ACCTGTGGTT CCCAATGCTC 
TTTTGACACT TGGACACCAA GGGTTACGAG 
Mart-1 


CACCTGCTTA 
GTGGACGAAT 


35 


8001' 


TGAGAAACTC 
ACTCTTTGAG 


TCTGCAGAAC AGTCACCACC ACCTTATTCA 
AGACGTCTTG TCAGTGGTGG TGGAATAAGT 
sE/L Promoter 


CCTTAATCTA 
GGAATTAGAT 


40 


0051 


GAGTCGACCT GCAGGCATGC AAAAATTGAA ATTTTATTTT 
CTCAGCTGGA CGTCCGTACG TTTTTAACTT TAAAATAAAA 
sE/L Promoter 


TTTTTTTTGG 
AAAAAAAACC 








Mage 1-3 minigene 




45 


8101 


AATATAAATA 
TTATATTTAT 


ATGGAGTCCT TGCAGCTGGT CTTTGGCATT 
TACCTCAGGA ACGTCGACCA GAAACCGTAA 
Mage 1-3 minigene 


GACGTGAAGG 
CTGCACTTCC 


50 


8151 


AAGCAGACCC CACCGGCCAC TCCTATGTCC TTGTCACCTG CCTAGGTCTC 
TTCGTCTGGG GTGGCCGGTG AGGATACAGG AACAGTGGAC GGATCCAGAG 
Mage 1-3 minigene 




8201 


TCCTATGATG 
AGGATACTAC 


GCAATAAGCG TAAAGAAGTG GACCCCATCG 
CGTTATTCGC ATTTCTTCAC CTGGGGTAGC 


GCCACTTGTA 
CGGTGAACAT 



55 
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Mage 1-3 minigene C5 Left Arm 



0251 CTAGTTTTTA TCCCGGGTTT • TTATGACTAG TTAATCACGG CCGCTTATAA 
GATCAAAAAT AGGGCCCAAA AATACTGATC AATTAGTGCC GGCGAATATT * 
5 C5 Left Arm 





8301 


AGATCTAAAA TGCATAATTT CTAAATAATG AAAAAAAAGT ACATCATGAG 






TCTAGATTTT ACGTATTAAA 


GATTTATTAC 


TTTTTTTTCA 


TGTAGTACTC 


10 






C5 


Left Arm 






B351 


CAACGCGTTA 


GTATATTTTA 


CAATGGAGAT 


TAACGCTCTA 


TACCGTTCTA 






GTTGCGCAAT 


CATATAAAAT 


GTTACCTCTA 


ATTGCGAGAT 


ATGGCAAGAT 








C5 


Left Arm 






15 


8401 


TGTTTATTGA 


TTCAGATGAT 


GTTTTAGAAA 


AGAAAGTTAT 


TGAATATGAA 






ACAAATAACT 


AAGTCTACTA 


CAAAATCTTT 


TCTTTCAATA ACTTATACTT 








C5 


Left Arm 








8451 


AACTTTAATG AAGATGAAGA 


TGACGACGAT 


GATTATTGTT 


GTAAATCTGT 






TTGAAATTAC 


TTCTACTTCT 


ACTGCTGCTA CTAATAACAA 


CATTTAGACA 








C5 


Left Arm 








8501 


TTTAGATGAA 


GAAGATGACG 


CGCTAAAGTA 


TACTATGGTT 


ACAAAGTATA 






AAATCTACTT 


CTTCTACTGC 


GCGATTTCAT 


ATGATACCAA 


TGTTTCATAT 


25 






C5 


Left Arm 








8551 


AGTCTATACT 


ACTAATGGCG 


ACTTGTGCAA 


GAAGGTATAG 


TATAGTGAAA 






TCAGATATGA 


TGATTACCGC 


TGAACACGTT 


CTTCCATATC 


ATATCACTTT 








C5 


Left Arm 






30 


0601 


ATGTTGTTAG 


ATTATGATTA 


TGAAAAACCA 


AATAAATCAG 


ATCCATATCT 






TACAACAATC 


TAATACTAAT 


ACTTTTTGGT 


TTATTTAGTC 


TAGGTATAGA 








C5 


Left Arm 






35 


0651 


AAAGGTATCT 


CCTTTGCACA 


TAATTTCATC 


TATTCCTAGT 


TTAGAATACT 






TTTCCATAGA 


GGAAACGTGT 


ATTAAAGTAG 


ATAAGGATCA 


AATCTTATGA 








C5 


Left Arm 








8701 


TTTCATTATA 


TTTGTTTACA 


GCTGAAGACG 


AAAAAAATAT 


• ATCGATAATA. 


40 




AAAGTAATAT 


AAACAAATGT 


CGACTTCTGC 


TTTTTTTATA 


TAGCTATTAT 








• C5 


Left Arm 








8751 


GAAGATTATG 


TTAACTCTGC 


TAATAAGATG 


AAATTGAATG 


AGTCTGTGAC 






CTTCTAATAC AATTGAGACG 


ATTATTCTAC 


TTTAACTTAC 


TCAGACACTG 


45 




C5 Left Arm 










8801 


TGCAGCCAAG 


CTTGGCACTG 


GCCGTCGTTT 


TACAACGTCG 


TGACTGGGAA 






ACGTCGGTTC 


GAACCGTGAC 


CGGCAGCAAA 


ATGTTGCAGC 


ACTGACCCTT 




8851 


AACCCTGGCG 


TTACCCAACT 


TAATCGCCTT 


GCAGCACATC 


CCCCTTTCGC 


50 




TTGGGACCGC 


AATGGGTTGA- 


ATTAGCGGAA 


CGTCGTGTAG 


GGGGAAAGCG 




8901 


CAGCTGGCGT 


AATAGCGAAG 


AGGCCCGCAC 


CGATCGCCCT 


TCCCAACAGT 






GTCGACCGCA 


TTATCGCTTC 


TCCGGGCGTG 


GCTAGCGGGA 


AGGGTTGTCA 




8951 


TGCGCAGCCT 


GAATGGCGAA. 


TGGCGCCTGA 


TGCGGTATTT 


TCTCCTTACG 






ACGCGTCGGA 


CTTACCGCTT 


ACCGCGGACT 


ACGCCATAAA 


AGAGGAATGC 


55 ' 


9001 


CATCTGTGCG 


GTATTTCACA 


CCGCATATGG 


TGCACTCTCA 


GTACAATCTG 






GTAGACACGC 


CATAAAGTGT 


GGCGTATACC 


ACGTGAGAGT 


CATGTTAGAC 
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9051 CTCTGATGCC GCATAGTTAA GCCAGCCCCG ACACCCGCCA ACACCCGCTG 

GAGACTACGG CGTATCAATT CGGTCGGGGC TGTGGGCGGT TGTGGGCGAC 
9101 ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT 

TGCGCGGGAC TGCCCGAACA GACGAGGGCC GTAGGCGAAT GTCTGTTCGA 
9.151 GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC CGTCATCACC 

CACTGGCAGA GGCCCTCGAC GTACACAGTC TCCAAAAGTG GCAGTAGTGG 
9201 GAAACGCGCG AGACGAAAGG GCCTCGTGAT ACGCCTATTT TTATAGGTTA 

CTTTGCGCGC TCTGCTTTCC CGGAGCACTA TGCGGATAAA AATATCCAAT 
9251 ATGTCATGAT AATAATGGTT TCTTAGACGT CAGGTGGCAC TTTTCGGGGA 

TACAGTACTA TTATTACCAA AGAATCTGCA GTCCACCGTG AAAAGCCCCT 
9301 AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT 

TTACACGCGC CTTGGGGATA AACAAATAAA AAGATTTATG TAAGTTTATA 
9351 GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA 

CATAGGCGAG TACTCTGTTA TTGGGACTAT TTACGAAGTT ATTATAACTT 

Amp(R) . 



9401 AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT TATTCCCTTT 
TTTCCTTCTC ATACTCATAA GTTGTAAAGG CACAGCGGGA ATAAGGGAAA 

Amp (R) 



94 51 TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA CGCTGGTGAA 
AAACGCCGTA AAACGGAAGG ACAAAAACGA GTGGGTCTTT GCGACCACTT 

Amp (R) 



9501 AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC 

TCATTTTCTA CGACTTCTAG TCAACCCACG TGCTCACCCA ATGTAGCTTG 

Amp(R) 

9551 TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC CGAAGAACGT 

ACCTAGAGTT GTCGCCATTC TAGGAACTCT CAAAAGCGGG GCTTCTTGCA 
. • Amp (R) 

9601 TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC 

AAAGGTTACT ACTCGTGAAA ATTTCAAGAC GATACACCGC GCCATAATAG 

Amp{R> 



9651 CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC 
GGCATAACTG CGGCCCGTTC TCGTTGAGCC AGCGGCGTAT GTGATAAGAG 

Amp (R) 



9701 AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT 
TCTTACTGAA CCAACTCATG AGTGGTCAGT GTCTTTTCGT AGAATGCCTA 

Amp (R) 



9751 GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA TGAGTGATAA . 
CCGTACTGTC ATTCTCTTAA TACGTCACGA CGGTATTGGT ACTCACTATT 

Amp (R) 



9801 CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG AAGGAGCTAA 
GTGACGCCGG TTGAATGAAG ACTGTTGCTA GCCTCCTGGC TTCCTCGATT 

Amp {R) 



9 851 CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG 
GGCGAAAAAA CGTGTTGTAC CCCCTAGTAC ATTGAGCGGA ACTAGCAACC 
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Amp(R) 

, 9901 sS^S^^. 

Amp ( R ) 

9951 ^SsS^ 
'° 10001 Mra§3Kraa 

23 10151 

10251 CTCACTGATT AAGCATTGGT ~AACTrTr irt 

GAGTGACTAA TTCGTAACCA iJSSSS ^ GTTTAC TCATATATAC ' 
1030! TTTAGATTGA TTTAAAACT7 SSESIS AGTATft ™ 

AAATCTAACT AAATTTTraa rCa*,.. TTAAAAGGAT CTAGGTGAAG 
35 10351 AICCTTTtS IS5SS! SS™ AATTTTCCTA GATCCACTT? 

TAGGAAAAAC TATTaSSa ^TTAACGTG AGTTTTCGTT 

10401 CCACTGAGCG TCAGACCCCG GGAATTGCAC TCAAAAGCAA 

GGTGACTCGC AGTcJggS 55S5SJ SS?*"* TCTTG ^ATC 
10-151 CTTTTTTTCT GCGCGTAATC allJ 00 ™^ AGA *CTCTAG ' 

40 GAAAAAAAGA CGCGCATTAG aSSH^ ^ CMAMA ACCACCGCTA. ■ 

105O1 • CCAGCGGTGG TTTgStGCC SSEXSS TTTGTTTTTT TGGTGGCGAT 
. GGTCGCCACC AaTSUcGG SSSKS SiEST* TTTTTC ^AA 
10551 GGTAACTGGC TTCAGCAGAG crrArl™^ GATGGTTGAG AAAAAGGCTT. 

CCATTGACCG AaSosJS? 22SI££ AAATACTGTC CTTCTAGTGT 
« 10601 AGCCGTAGTT AGGCcSIc TTTATGACAG GAAGATCACA " 

TCGGCATCAA TCCGGTGGTG SS2SJ2£f CTGTAGCAC C GCCTACATAC 
10651 CTCGCTCTGC J5SSSS5 S2 SSE 6 "* CGG ^GTATG 
GAGCGAGACG ATTAGGACAA SrSl GCTGCCAGTG GCGATAAGTC 
10701 GTGTCTTACC GGGtoSaS CGACGGTCAC CGCTATTCAG 

5 ° CACAGAATGG CCCAACCTgI S52SS?2 GTTACCGGAT AAGGGGCAGC 

10751 GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT ^ TGG ^ G ^^^ 
CCAGCCCGAC TTGCCCCCCA Lrncr^^ AGCCCAGCTT GGAGCGAACG 
10801 ACCTACACCG AACTGAGaS SSS CCT <*CTTGC 

cgaagggc. ccSSSS SSSS 2SS5 SSSSSSS 
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10901 


GAACAGGAGA 




CTTGTCCTCT 


1 0951 

X V 5" mi X 


TATAGTCCTG 




ATATCAGGAC 


liuuJL 


ATGCTrGTCA 




TACGAGCAGT 


X X V J X 


TTTTACGGTT 




AAAATGCCAA 


1 1 1 m 

IX 1 U J. 






. CGCAATAGGG 


•11151 


TGATACCGCT 




ACTATGGCGA 


11201 


AGGAAGCGGA 




TCCTTCGCCT 


11251 


CCGATTCATT 




GGCTAAGTAA 


11301 


CAGTGAGCGC 




GTCACTCGCG 


11351 


AGGCTTTACA 




TCCGAAATGT 


11101 


GGATAACAAT 




CCTATTGTTA 


11451 


TTGCGGCCGC 



GCGCACGAGG GAGCTTCCAG 
CGCGTGCTCC CTCGAAGGTC 
TCGGGTTTCG CCACCTCTGA 
AGCCCAAAGC GGTGGAGACT 
GGGGGGCGGA GCCTATGGAA 
CCCCCCGCCT CGGATACCTT 
CCTGGCCTTT TGCTGGCCTT 
GGACCGGAAA ACGACCGGAA 
CTGATTCTGT GGATAACCGT 
GACTAAGACA CCTATTGGCA 
CGCCGCAGCC GAACGACCGA 
GCGGCGTCGG CTTGCTGGCT 
AGAGCGCCCA ATACGCAAAC 
TCTCGCGGGT TATGCGTTTG 
AATGCAGCTG GCACGACAGG 
TTACGTCGAC CGTGCTGTCC 
AACGCAATTA ATGTGAGTTA 
TTGCGTTAAT TACACTCAAT 
CTTTATGCTT CCGGCTCGTA 
GAAATACGAA GGCCGAGCAT 
TTCACACAGG AAACAGCTAT 
AAGTGTGTCC TTTGTCGATA 
AATTCAACGC CGGCGTTAAG 



GGGGAAACGC CTGGTATCTT 
CCCCTTTGCG GACCATAGAA 
CTTGAGCGTC GATTTTTGTG 
GAACTCGCAG CTAAAAACAC 
AAACGCCAGC AACGCGGCCT 
TTTGCGGTCG TTGCGCCGGA 
TTGCTCACAT GTTCTTTCCT 
AACGAGTGTA CAAGAAAGGA 
ATTACCGCCT TTGAGTGAGC 
TAATGGCGGA AACTCACTCG 
GCGCAGCGAG TCAGTGAGCG 
CGCGTCGCTC AGTCACTCGC 
CGCCTCTCCC CGCGCGTTGG 
GCGGAGAGGG GCGCGCAACC 
TTTCCCGACT GGAAAGCGGG 
AAAGGGCTGA CCTTTCGCCC 
GCTCACTCAT TAGGCACCCC 
CGAGTGAGTA ATCCGTGGGG 
TGTTGTGTGG AATTGTGAGC 
ACAACACACC ' TTAACACTCG 
GACCATGATT ACGAATTGAA 
CTGGTACTAA TGCTTAACTT 



BWSDOaD: <WO _200SO2637O*2J_> 



47/53 



10 



15 



WO 2005/026370 

PCT/US2004/02875I 



NY-ESCM 

. £ z z % 2 z z r r s ~ - - »> »» - 

«y T. cly „u Ala Gly „. ^° 0 ? = " P '° ^ «V »s» „. 01y 
«* «. «. „.■ ser "* p ^ «" "» »ro A,„ My », 
»i. Sly My Ala «. Sar T ' ^ "* "° «*» «» •» 

. M. Thr P„ „at a , , j ^ ^ "» "» >* «. Mat ,„ phe 
»•* «y A.„ a. Laa J „! *" teU Le " 01 " «» Thr val 

- «. Laa s „ n . S « l r r ua an> «■ *» 

£ na * Ola cya ». « £ £ * * - - « »aa 
Gly Gin Arg- Arg a cln Pr <> Pro set 



VS0OCftt <WO 2O05028370A2. 



48/53 



WO 2005/02M70 



PCT/US2004/028751 



FIGURE 6C 
TRP-2 

Met Ser Pro Leu Trp Trp Gly Phe Leu Leu Ser Cys Leu Gly Cys Lys lie 
Leu Pro Gly Ala Gin Gly Gin Phe Pro Arg Val Cys Met Thr Val Asp Ser 
Leu Val Asn Lys Glu Cys Cys Pro Arg Leu -Gly Ala Glu Ser Ala. Asn Val 
Cys Gly Ser Gin Gin Gly Arg Gly Gin Cys Thr Glu Val Arg Ala Asp Thr 
Arg Pro Trp Ser Gly Pro Tyr He Leu Arg Asn Gin Asp Asp Arg Glu Leu 
Trp Pro Arg Lys Phe Phe His Arg Thr Cys Lys Cys Thr Gly Asn Phe Ala 
Gly Tyr Asn Cys Gly Asp Cys Lys Phe Gly Trp Thr Gly Pro Asn Cys Glu 
Arg Lys Lys Pro Pro Val He Arg Gin Asn He His Ser Leu Ser Pro Gin 
Glu Arg Glu Gin Phe Leu Gly Ala Leu Asp Leu Ala Lys Lys Arg Val His 
Pro Asp Tyr Val He Thr Thr Gin His Trp Leu Gly Leu Leu.. Gly Pro Asn 
Gly Thr Gin Pro Gin Phe Ala Asn Cys Ser Val Tyr Asp Phe Phe Val Trp 
Leu His Tyr Tyr Ser Val Arg Asp Thr Leu Leu Gly Pro Gly Arg Pro Tyr 
Arg Ala He Asp Phe Ser His Gin Gly Pro Ala Phe Val Thr Trp His Arg 
Tyr His Leu Leu Cys Leu Glu Arg Asp Leu Gin Arg Leu He •Gly Asn Glu 
Ser Phe Ala Leu Pro Tyr Trp Asn Phe Ala Thr Gly Arg' Asn Glu Cys Asp 
Val Cys Thr Asp Gin Leu Phe Gly Ala Ala Arg Pro Asp Asp Pro Thr Leu 
lie Ser Arg Asn Ser Arg Phe Ser Ser Trp Glu Thr Val Cys Asp Ser Leu 
Asp Asp Tyr Asn His Leu Val Thr Leu Cys Asn Gly Thr Tyr Glu Gly Leu 
Leu Arg Arg Asn Gin Met Gly Arg Asn Ser Met Lys Leu Pro Thr Leu Lys 
Asp He Arg Asp Cys LeU Ser Leu Gin Lys Phe Asp Asn Pro Pro Phe Phe 
Gin Asn Ser Thr Phe Ser Phe Arg Asn Ala Leu Glu Gly Phe Asp Lys Ala 
• Asp Gly Thr Leu Asp Ser Gin Val Met Ser Leu His Asn Leu Val His Ser 
Phe Leu Asn Gly Thr Asn Ala Leu Pro His Ser Ala Ala Asn Asp Pro He 
Phe Val Val Leu His. Ser Phe Thr Asp Ala He Phe Asp Glu Trp Met. Lys 
Arg Phe Asn Pro Pro Ala Asp Ala Trp Pro Gin Glu Leu Ala Pro He Gly 
His Asn Arg Met Tyr Asn Met Val Pro Phe Phe Pro Pro Val Thr Asn -Glu 
Glu Leu Phe Leu Thr Ser Asp Gin Leu Gly Tyr Ser Tyr Ala He Asp Leu 
Pro Val Ser Val Glu Glu Thr Pro Gly Trp Pro Thr Thr Leu Leu Val Val 
Met Gly Thr Leu Val Ala Leu Val Gly Leu Phe Val Leu Leu Ala Phe Leu 
• Gin Tyr Arg Arg Leu Arg- Lys Gly Tyr Thr Pro Leu Met Glu Thr His Leu 
Ser Ser Lys Arg Tyr Thr Glu Glu Ala 
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FIGURE 6D 
gplOO and gplOOM 
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FIGURE 6E 
MART-1 

Met Pro Arg Glu Asp Ala His Phe lie Tyr Gly Tyr Pro 
Lys Lys Gly His Gly His Ser. Tyr Thr Thr Ala. Glu Glu 

5 Ala Ala Gly lie Gly lie Leu Thr Val He Leu Gly Val 

. Leu Leu Leu He Gly Cys Trp Tyr Cys Arg- Arg Arg Asn 
Gly Tyr Arg Ala Leu Met Asp Lys Ser Leu His Val Gly 
Thr Gin Cys Ala Leu Thr Arg Arg Cys Pro Gin -Glu Gly 
Phe Asp His Arg Asp Ser Lys Val Ser Leu Gin Glu Lys. 

10 Asn Cys Glu Pro Val Val Pro Asn Ala Pro Pro- Ala Tyr 

Glu Lys Leu Ser Ala Glu Gin Ser Pro Pro Pro Tyr" Ser 
Pro 
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FIGURE 6F 
MAGE-1 

Met Ser Asp Asn Lys Lys Pro Asp Lys Ala His Ser n « 

ASD Gly Asd Glv 2io„ * r Gly Ser G1 y Q ly 

===r==========r= 

S=. v.! iop Glu % Mu « »» ^ »et s.r 

<*P >s P „ p >sp Asp Mp , ' £ «■ »P *P Txx ^ 

Glu Glu Glu Glu Leu ri„ »- , P P Glu As P As P 

«. «: r r z :; z z z r z - ci ° 

«T ». =!, «. cy . Ala c„. va. P rp £ 1 £ L " » 
■*» =1. v.x L ys cy. to9 „« „. ^ phe 21 1. T Ar9 

l. u v.! s „ Ito Pt0 wl Jon pro ^ ^ ^ i t 0 ? r *" "* 

oiu a.„ „, „ sp Glu 01u val sla y * " 2 Z T 

«. - «. «. MP «. slu Met «,, » - «■ - «» 



FIGURE fir: 
MAGE-3 
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FIGURE 6H 
B7.1 

mghtrrqgts pskcpylnf f qllvlaglsh fcsgvihvtk .evkevatlsc ghhvsveela 

qtriywqkek kmvltminsgd mniwpeyknr tifdi'tnnls ivilalrpsd e'gtyecvvlk 

yekdafkreh laevtlsvka dfptpsisdf eiptsnirri icstsggfpe. phlswlenge 

elnainttvs qdpetelyav sskldfnmtt nhsfmcliky ghlrvnqtfn wnttkqehfp 

dnllpswait lisvngifvi ccltycfapr crerrrnerl rresvrpv 

FIGURE 61 
LFA-3 

mvagsdagra Igvlsvvcll hcfgfiscfs qqiygvvygn vtfhvpsnvp Ikevlwkkqk 
dkvaelense frafssfknr vyldtvsgsl tiynltssde deyemespni tdtmkfflyv 
leslpsptlt caltngsiev qcmipehyns hrglimyswd cpmeqckrns tsiyfkmend 
lpqkiqctls nplfnttssi ilttcipssg hsrhryalip iplavittci vlymngilkc 
drkpdrtnsn 

FIGURE 63 
ICA1VM* 

mapssprpal pallvllgal fpgpgnaqts vspskvilpr ggsvlvtcst scdqpkllgi 
etplpkkell Ipgnnrkvye lsnvqedsqp mcysncpdgq staktfltvy wtpervelap 
Ipswqpvgkn itlrcqvegg apranltvvl Irgekelkre pavgepaevt ttvlvrrdhh 
ganfscrtel dlrpqglelf entsapyqlq tfvlpatppq lvsprvlevd tqgtvvcsld 
glfpvseaqv hlalgdqrln ptvtygndsf sakasvsvta edegtqrltc ayilgnqsqe 
tlqtvtiysf papnviltkp evsegtevtv kceahprakv tlngvpaqpl gpraqlllka 
tpedngrsfs csatlevagq lihknqtrel rvlygprlde rdcpgnwtwp ensqqtpmcq 
awgnplpelk clkdgtfplp igesvtvtrd legtylcrar stqgevtrev tvnvlsprye 
iviitvvaaa vimgtaglst ylynrqrkik kyrlqqaqkg tpmkpntqat pp 

*mature sequence begins at residue 28 (q) 
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